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ATTORNEY'S DOCKET NO. B0192/7019(ERP) 

IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 

Applicant: VAN BROECKHOVEN, Christine, et al. 

Serial No: Not Yet Assigned 

Filed: Herewith 

International Application No. : PCT/EP98/08543 

International Filing Date: 1 7 December 1 998 (1 7. 1 2.98) 

Priority Date Claimed: 1 8 December 1997 (18.1 2.97) 

For: MOOD DISORDER GENE 

COMMISSIONER FOR PATENTS 
WASHINGTON, D.C. 20231 

Sir: 

PRELIMINARY AMENDMENT 

Please enter the Preliminary Amendment as follows: 
In the Specification : 

Please amend the specification on page 1, immediately following the title by inserting 
the following: 

Related Applications 

This application claims foreign priority benefits under Title 35, U.S.C., §1 19(a)-(d) 
or §365(a),(b) of PCT application no. PCT/EP98/08543, filed December 17, 1998, and 
foreign patent application no. GB 9726804.9, filed December 18, 1997, the entire contents of 
which are incorporated herein by reference. -- 

On page 67, line 21, please add —All references disclosed herein are incorporated by 
reference in their entirety.— 



In the Claims : 

Please amend claims 4, 6, 8, 9, 11, 13, 14, 16, 17, 19, 28, 40, 41, 43, 44, 45, and 47, 
before calculating the filing fee as follows: 

4. (Amended) The use as claimed in claim 2 [or 3] wherein said YAC clone is 
961_h_9, 942_c_3, 766_f_12, 731_c_7, 907_e_l, 752-g-8 or 717_d_3. 

6. (Amended) The use as claimed in claim 1 [any preceding claim] wherein said 
mood disorder or related disorder is selected from the Diagnostic and Statistical Manual of 
Mental Disorders, version 4 (DSM-IV) taxonomy and includes mood disorders (296.XX, 
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300.4, 311, 301, 13, 295.70), schizophrenia and related disorders (295, 297.1, 298.9, 297.3, 
298.9), anxiety disorders (300.XX, 309.81, 308.3), adjustment disorders (309, XX) and 
personality disorders (codes 301. XX). 

8. (Amended) A method of identifying at least one human gene, including 
mutated or polymorphic variants thereof, which is associated with a mood disorder or related 
disorder which comprises fragmentation of a YAC clone as defined in claim 2 [any one of 
claims 2 to 4] and detection of nucleotide triplet repeats. 

9. (Amended) A method as claimed in claim 7 [or 8] wherein said repeated 
triplet is CAG or CTG. 

1 1 . (Amended) A method of identifying at least one human gene including 
mutated or polymorphic variants thereof, which is associated with a mood disorder or related 
disorder wherein said gene is present in the DNA comprised in the YAC clones as defined in 
claim 2 [any one of claims 2 to 5], which method comprises the step of detecting an 
expression product of said gene with an antibody capable of recognising a protein with an 
amino acid sequence comprising a string of at least 8 continuous glutamine residues. 

13. (Amended) A method as claimed in claim 1 1 [or claim 12] wherein said 
antibody is mAB 1C2. 

14. (Amended) A method of preparing a contig map of YAC clones of the region 
of human chromosome 18q between polymorphic markers D18S60 and D18S61 which 
comprises the steps of: 

(a) subcloning the YAC clones according to claim 2 [any one of claims 2 to 5] into 
exon trap vectors; 

(b) using the nucleotide sequences shown in any one of Figures 1 to 1 1 or any other 
known sequence tagged sequence from the YAC contig described herein, or part thereof 
consisting of not less than 14 contiguous bases or the complement thereof, to detect overlaps 
among the cosmid vectors, and 

(c) constructing a cosmid contig map of a YAC clone of said region. 

16. (Amended) A method of identifying at least one human gene or mutated or 
polymorphic variants thereof which is associated with a mood disorder or related disorder 
which comprises the steps of: 
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(a) subcloning the YAC clones according to claim 2 [any one of claims 2 to 5] into a 
cosmid, B AC, PAC or other vector; 

(b) using the nucleotide sequences shown in any one of Figures 1 to 1 1 or any other 
known sequence tagged sequence from the YAC contig described herein, or part thereof 
consisting of not less than 14 contiguous bases or the complement thereof, to defect overlaps 
amongst the subclones and construct a map thereof; 

(c) identifying the position of genes within the subcloned DNA by one or more of 
CpG island identification, zoo-blotting, hybridization of said subcloned DNA to a cDNA 
library or a Northern blot of mRNA from a panel of culture cell lines; 

(d) detecting differences between said genes and equivalent regions of the DNA of an 
individual afflicted with a mood disorder or related disorder; and 

(e) identifying said gene which, if defective, is associated with said mood disorder or 
related disorder. 

17. (Amended) An isolated human gene, including mutated or polymorphic 
variants thereof, which is associated with a mood disorder or related disorder which is 
obtainable by the method according to claim 7 [any of claims 7 to 13, 15 or 16]. 

19. (Amended) A cDNA encoding the protein of claim 18 which is obtainable by 
the method of claim 7 [any one of claims 7 to 13, 15 or 16]. 

28. (Amended) A method as in claim 26 [or 27] which comprises the steps of: 

a) obtaining a DNA sample from said individual; 

b) providing primers suitable for the amplification of a nucleotide sequence 
comprised in the sequence shown in any one of Figures 15a, 16a, 17a or 18a said primers 
flanking the trinucleotide repeats comprised in said sequence; 

c) applying said primers to the said DNA sample and carrying out an amplification 
reaction; 

d) carrying out the same amplification reaction on a DNA sample from a control 
individual; and 

e) comparing the results of the amplification reaction for the said individual and for 
the said control individual; 

wherein the presence of an amplified fragment from said individual which is bigger in 
size from said individual which is bigger in size from that of said control individual is an 
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indication of the presence of a susceptibility to a mood disorder or related disorder of said 
individual. 

40. (Amended) An expression vector which comprises a sequence of nucleotides 
as claimed in claim 21 [claims 21 or 22]. 

4 1 . (Amended) A reporter plasmid which comprises the promoter region of a 
nucleic acid molecule as claimed in claim 21 [or 22] positioned upstream of a reporter gene 
which encodes a reporter molecule so that expression of said reporter gene is controlled by 
said promoter region. 

43. (Amended) An eukaryotic cell or multicellular tissue or organism comprising 
a transgene encoding a protein as claimed in claim 23 [claims 23 or 24], 

44. (Amended) A method for determining if a compound is an enhancer or 
inhibitor of expression of a gene associated with a mood disorder or related disorder which 
comprises the steps of: 

a) contacting a cell as claimed in claim 42 with said compound; 

b) detecting and/or quantitatively evaluating the presence of any mRNA transcript 
corresponding to a nucleic acid as claimed in claim 21 [or 22]; and 

c) comparing the level of transcription of said nucleic acid with the level of 
transcription of the same nucleic acid in a cell as claimed in claim 42 not exposed to said 
compound [;]. 

45. (Amended) A method for determining if a compound is an enhancer or 
inhibitor of expression of a gene associated with a mood disorder or related disorder which 
comprises the steps of: 

a) contacting a cell as claimed in claim 42 with said compound; 

b) detecting and/or quantitatively evaluating the expression of a protein as claimed in 
claim 23 [claims 23 or 24]; and 

c) comparing the level of expression of said protein with that of the same protein in a 
cell not exposed to said compound. 

47. (Amended) A compound identified as an enhancer or an inhibitor of the 
expression of a gene associated with a mood disorder or related disorder by a method as 
claimed in claim 44 [claims 44 to 46]. 
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If any other information is needed, please contact the undersigned attorney by phone 
(617-720-3500, Ext. 343) to expedite the further prosecution of this patent application. 

Respectfully submitted, 

Elizabeth R. Plumer, Reg. No. 36,637 
WOLF, GREENFIELD & SACKS, P.C. 
600 Atlantic Avenue 
Boston, MA 02210 
Tel. (617)720-3500 
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The invention is concerned with the determination 
of genetic factors associated with psychiatric health 
with particular reference to a human gene or genes 
which contributes to or is responsible for the 
manifestation of a mood disorder or a related disorder 
in affected individuals. In particular , although not 
exclusively, the invention provides a method of 
identifying and characterising such a gene or genes 
from human chromosome 18, as well as genes so 
identified and their expression products. The 
invention is also concerned with methods of 
determining the genetic susceptibility of an 
individual to a mood disorder or related disorder- By 
mood disorders or related disorders is meant the 
following disorders as defined in the Diagnostic and 
Statistical Manual of Mental Disorders, version 4 
(DSM-IV) taxonomy (DSM-IV codes in parenthesis):- mood 
disorders (296, XX, 300,4, 311, 301.13, 295.70), 
schizophrenia and related disorders (295. XX, 
297.1,298.8, 297.3, 29S.9), anxiety disorders (300. XX, 
309.31,308.3), adjustment disorders (309. XX) and 
personality disorders (codes 301. XX). 

The methods of the invention are particularly 
exemplified in relation to genetic factors associated 
with a family of mood disorders known as Bipolar (BP) 
spectrum disorders. 

Bipolar disorder (BP) is a severe psychiatric 
condition that is characterized by disturbances in 
mood, ranging from an extreme state of elation (mania) 
to a severe state of dysphoria (depression) . Two types 
of bipolar illness have been described: type I BP 
illness (BPI) is characterized by major depressive 
episodes alternated with phases of mania, and type II 
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BP illness (BPII) , characterized by major depressive 
episodes alternating with phases of hypomania. 
Relatives of BP probands have an increased risk for 
BP, unipolar disorder (patients only experiencing 
5 depressive episodes; UP) , cyclothymia (minor 

depression and hypomania episodes; CY) as well as for 
schizoaffective disorders of the manic (SAm) and 
depressive (SAd) type. Based on these observations BP, 
CY, UP and SA are classified as BP spectrum disorders. 
10 The involvement of genetic factors in the etiology of 
BP spectrum disorders was suggested by family, twin 
and adoption studies (Tsuang and Faraone (1990), The 
Genetics of Hood Disorders , Baltimore, The John 
Hopkins University Press) * However, the exact pattern 
15 of transmission is unknown* In some studies, complex 
segregation analysis supports the existence of a 
single major locus for BP (Spence et al . (1995), Am J* 
Med, Genet (Neuropsych. Genet,) 6.0 pp 370-376), Other 
researchers propose a liability-threshold-model, in 
20 which the liability to develop the disorder results 

from the additive combination of multiple genetic and 
environmental effects (McGuffin et al . (1994), 
Affective Disorders; Seminars in Psychiatric Genetics 
Gaskell, London pp 110-127) . 
25 Due to the complex mode of inheritance, 

parametric and nonparametric linkage strategies are 
applied in families in which BP disorder appears to be 
transmitted in a Mendelian fashion. Early linkage 
findings on chromosomes llpl5 (Egeland et al . (1937), 
30 Nature 325 pp 783-787) and Xq27-q2S (Mendlewicz et al , 
(1937) The Lancet l pp 1230 -1232/ Baron et al. (1987) 
Nature 326 pp 289-292) have been controversial and 
could initially not be replicated (Kelsoe et al. 
(1989) Nature 242 pp 238-243; Baron et ai . (1993) 
35 Nature Genet 3 pp 49-SS) . With the development of a 
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human genetic map saturated with highly polymorphic 
markers and the continuous development of data 
analysis techniques, numerous new linkage searches 
were started. In several studies , evidence or 
5 suggestive evidence for linkage to particular regions 
on chromosomes 4, 12 , 18 , 21 and X was found 
(Blackwood et al* (1996) Nature Genetics 12 pp 427- 
430, Craddock et al. (1994) Brit J. Psychiatry 164 pp 
355-358, Berrettini et al . (1994), Proc Natl Acad Sci 
10 USA 91 pp 5918-5921, Straub et al . (1994) Nature 
Genetics 8 pp 291-296 and Pekkarinen et al . (1995) 
Genome Research 5 pp 105-115) . In order to test the 
validity of the reported linkage results , these 
findings have to be replicated in other, independent 
15 studies. 

Recently, linkage of bipolar disorder to the 
pericentromeric region on chromosome 18 was reported 
(Berrettini et al. 1994). Also a ring chromosome 18 
with break-points and deleted regions at 18pter-pil 
20 and 18q23-qter was reported in three unrelated 
patients with BP illness or related syndromes 
(Craddock et al. 1994). The chromosome 18p linkage 
was replicated by stine et al . (1995) Am J Hum Genet 
57 pp 1384-1394, who also reported suggestive evidence 
25 for a locus on I8q21.2-g21.32 in the same study. 

Interestingly, Stine et al- observed a parent-of- 
origin effect; the evidence of linkage was the 
strongest in the paternal pedigrees, in which the 
proband's father or one of the proband's father's sibs 
30 is affected. 

In an independent replication study, the present 
inventors tested linkage with chromosome 18 markers in 
10 Belgian families with a bipolar proband. To 
localize causative genes the linkage analysis or 
35 likelihood method was used in these families. This 
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method studies within a family the segregation of a 
defined disease phenotype with that of polymorphic 
genetic markers distributed in the human genome* The 
likelihood ratio of observing cosegregation of the 
5 disease and a genetic marker under linkage versus no 

linkage is calculated and the log of this ratio or the 
log of the odds is the LCD score statistic z. A LOD 
score of 3 (or likelihood ratio of 1000 or greater) is 
taken as significant statistical evidence for linkage. 
10 In the inventors 1 study no evidence for linkage to the 
pericentromeric regions was found, but in one of the 
families, MAD31, a Belgian family of a BPII proband, 
suggestive linkage was found with markers located at 
18q21.33-q23 (De bruyn et al . (1996) Biol Psychiatry 
15 39 pp 679-683) . Multipoint linkage analysis gave the 
highest LOD score in the interval between STR (Short 
Tandem Repeats) polymorphisms D18S51 and D18S61, with 
a maximum multipoint LOD score of +1.34. Simulation 
studies indicated that this LOD score is within the 
20 range of what can be expected for a linked marker 
given the information available in the family. 
Likewise, an affected sib-pair analysis also rejected 
the null-hypothesis of nonlinkage for several of the 
markers tested. Two other groups also found evidence 
25 for linkage of bipolar disorder to 18q (Freimer et a J . 
(1996) Nature Genetics 12 pp 436-441, coon et al . 
(1996) Biol Psychiatry 39 pp 689 to 696) . Although 
the candidate regions in the different studies do not 
entirely overlap/ they all suggest the presence of a 
30 susceptibility locus at 18q21-q23, 

The inventors have now carried out further 
investigations into the lSq chromosomal region in 
family MAD31. By analysis of cosegregation of bipolar 
disease in MAD31 with twelve STR polymorphic markers 
35 previously located between the aforementioned markers 
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D18S51 and D18S61 and subsequent LOD score analysis as 
described above, the inventors have further refined 
the candidate region of chromosome 18 in which a gene 
associated with mood disorders such as bipolar 
5 spectrum disorders may be located and have constructed 
a physical map. The region in question may thus be 
used to locate, isolate and sequence a gene or genes 
which influences psychiatric health and mood. 

The inventors have also constructed a YAC (yeast 

10 artificial chromosome) contig map of the candidate 

region to determine the relative order of the twelve 
STR markers mapped by the cosegregational analysis and 
they have identified seven clones from the YAC library 
incorporating the candidate region. 

15 A number of procedures can be applied to the 

identified YAC clones and, where applicable, to the 
DNA of an individual afflicted with a mood disorder as 
defined herein, in the process of identifying and 
characterising the relevant gene or genes. For 

20 example, the inventors have used YAC clones spanning 

the region of interest in chromosome 18 to identify by 
CAG or CTG fragmentation novel genes that are 
allegedly involved in the manifestation of mood 
disorders or related disorders* 

25 Other procedures can also be applied to the said 

YAC clones to identify candidate genes as discussed 
below. 

Once candidate genes have been identified it is 
possible to assess the susceptibility of an individual 
30 to a mood disorder or related disorder by detecting 

the presence of a polymorphism associated with a mood 
disorder or related disorder in such genes. 

Accordingly, in a first aspect the present 
35 invention comprises the use of an 8.9 cM region of 
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human chromosome 18q disposed between polymorphic 
markers D18S68 and D18S979 or a fragment thereof for 
identifying at least one human gene, including mutated 
and polymorphic variants thereof , which is associated 
5 with mood disorders or related disorders as defined 
above. As will be described below, the present 
inventors have identified this candidate region of 
chromosome 18q for such a gene, by analysis of 
cosegregation of bipolar disease in family MAD31 with 
10 12 STR polymorphic markers previously located between 
D18S51 and D18S61 and subsequent LCD score analysis. 

In a second aspect the invention comprises the 
use of a YAC clone comprising a portion of human 

15 chromosome l8q disposed between polymorphic markers 
D18S60 and D18S61 for identifying at least one human 
gene, including mutated or polymorphic variants 
thereof, which is associated with mood disorders or 
related disorders as defined above. D18S60 is close 

20 to D18S51 so the particular YAC clones for use are 

those which have an artificial chromosome spanning the 
candidate region of human chromosome l8q between 
polymorphic markers D18S51 and D13S61 as identified by 
the present inventors in their earlier paper {De bruyn 

25 et al . (1996) ) . 

Particular YACs covering the candidate region 
which may be used in accordance with the present 
invention are 961„h.9, 942_c_3, 766. f, 12, 731.C.7, 
9G7_$ w i, 752-g-8 and 717, d, 3, preferred ones being 961, 

30 h_9, 766, f ^12 and 907.e.l since these have the minimum 

tiling path across the candidate region- Suitable YAC 
clones for use are those having an artificial 
chromosome spanning the refined candidate region 
between D18S68 and D13S979. 
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There are a number of methods which can he 
applied to the candidate regions of chromosome 18q as 
defined above, whether or not present in a YAC, to 
identify a candidate gene or genes associated with 
5 mood disorders or related disorders. For example, it 
has previously been demonstrated that an apparent 
association exists between the presence of 
trinucleotide repeat expansions (TRE) in the human 
genome and the phenomenon of anticipation of mood 

10 disorders (Lindblad et al. (1995), Neurobiology of 
Disease 2: 55-62 and O'Donovan et al - (1995), Nature 
Genetics 10: 380-381). 

Accordingly, in a third aspect the present 
invention comprises a method of identifying at least 

15 one human gene, including mutated and polymorphic 
variants thereof, which is associated with a mood 
disorder or related disorder as defined herein which 
comprises detecting nucleotide triplet repeats in the 
region of human chromosome 18q disposed between 

20 polymorphic markers D18S68 and D18S979- 

An alternative method of identifying said gene or 
genes comprises fragmenting a YAC clone comprising a 
portion of human chromosome I8q disposed between 

25 polymorphic markers D18S60 and D18S61, for example one 
or more of the seven aforementioned VAC clones, and 
detecting any nucleotide triplet repeats in said 
fragments. Nucleic acid probes comprising at least 5 
and preferably at least 10 CTG and/or CAG triplet 

30 repeats are a suitable means of detection when 

appropriately labelled. Trinucleotide repeats may 
also be determined using the known RED (repeat 
expansion detection) system (Shalling et al.(1993), 
Nature Genetics 4 pp 135-139). 

3 5 In a fourth embodiment the invention comprises a 
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method of identifying at least one gene, including 
mutated and polymorphic variants thereof, which is 
associated with a mood disorder or related disorder 
and which is present in a YAC clone spanning the 
5 region of human chromosome 18q between polymorphic 

markers D18S60 and Di$S61, the method comprising the 
step of detecting the expression product of a gene 
incorporating nucleotide triplet repeats by use of an 
antibody capable of recognising a protein with an 

10 amino acid sequence comprising a string of at least 8; 
but preferably at least 12, continuous glutamine 
residues. Such a method may be implemented by 
subcloning YAC DNA, for example from the seven 
aforementioned YAC clones, into a human DNA expression 

15 library, A preferred means of detecting the relevant 
expression product is by use of a monoclonal antibody, 
in particular mAB 1C2, the preparation and properties 
of which are described in International Patent 
Application Publication No wo 97/17445. 

20 

As will be described in detail below, in order to 
identify candidate genes containing triplet repeats, 
the inventors have carried out direct CAG or ctg 
fragmentation of YACs 961. h.9, 766,f.l2 and 907. e„l, 

25 comprising a portion of human chromosome 18g disposed 
between polymorphic markers D18S60 and D18S61, and 
have identified a number of sequences containing CAG 
or CTG repeats, whose abnormal expansion may be 
involved in genetic susceptibility to a mood disorder 

30 or related disorder* 

Accordingly, in a fifth aspect, the invention 
provides a nucleic acid comprising the sequence of 
nucleotides shown in any one of Figures 15a, 16a, 17a, 
or 13a. 

35 
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In a further aspect, the invention provides a 
protein comprising an amino acid sequence encoded by 
the sequence of nucleotides shown in any one of 
Figures 15a, 16a, 17a, or 18a* 

5 

In yet a further aspect the invention provides a 
mutated nucleic acid comprising a sequence of 
nucleotides which differ from the sequence of 
nucleotides shown in any one of Figures iSa, 16a, 17a, 
10 or 18a only in the extent of trinucleotide repeats. 

Also provided by the invention is a mutated protein 
comprising an amino acid sequence encoded by a 
sequence of nucleotides which differ from the sequence 
15 of nucleotides shown in any one of Figures 15a, 16a, 
17a, or 18a only in the extent of trinucleotide 
repeats , 

It is to be understood that the invention also 
20 contemplates nucleotide sequences having. at least 75% 
and preferably at least 80% homology with any of the 
sequences described above and having functional 
identity with any of said sequences. The homology is 
calculated as described by Altschul et al • (1997) 
25 Nucleic Acids Res. 25: 3389-3402, Karlin et al. (1990) 
Proc Natl Acad Sci USA 87: 2264-68 and Karlin et al . 
(1993) Proc Natl Acad Sci USA 90: 5873-5877. Also 
contemplated are amino acid sequences which differ 
from the above described sequences only in 
30 conservative amino acid changes. Suitable changes are 
well known to those skilled in the art. 

Knowledge of the sequences described above can be 
used to design assays to determine the genetic 
35 susceptibility of an individual to a mood disorder or 
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related disorder. 

Accordingly, in a further aspect the invention 
provides a method for determining the susceptibility 
of an individual to a mood disorder or related 
5 disorder which comprises the steps of; 

a) obtaining a DNA sample from said 
individual; 

10 b) providing primers suitable for the 

amplification of a nucleotide sequence comprised in 
the sequence shown in any one of Figures 15a, 16a, 17a 
or 18a said primers flanking the trinucleotide repeats 
comprised in said sequence; 

15 

c) applying said primers to the said DNA 
sample and carrying out an amplification reaction; 

d) carrying out the same amplification 

2 0 reaction on a DNA sample from a control individual ; 
and 

e) comparing the results of the 
amplification reaction for the said individual and for 

25 the said control individual; 

wherein the presence of an amplified fragment 
from said individual which is bigger in size from that 
of said control individual is an indication of the 

3D presence of a susceptibility to a mood disorder or 
related disorder of said individual. 
By control individual is meant an individual who is 
not affected by a mood disorder or related disorder 
and does not have a family history of mood disorders 

35 or related disorders. 
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Preferable primers to use in this method are those 
shown in Figure 15b, 16b, 17b or 18b but other 
suitable primers may be utilised* 

5 In a further aspect the invention provides a 

method of determining the susceptibility of an 
individual to a mood disorder or related disorder 
which method comprises the steps of : 

10 a) obtaining a protein sample from said 

individual; and 

b) detecting the presence of a protein 
comprising an amino acid sequence encoded by a 
15 sequence of nucleotides which differ from the sequence 
of nucleotides shown in any one of Figures 15a, 16a , 
17a, or 18a only in the extent of trinucleotide 
repeats 

20 wherein the presence of said protein is an 

indication of the presence of a susceptibility to a 
mood disorder or related disorder of said individual. 

Preferably, the foresaid protein is detected by 
utilising an antibody that is capable of recognising a 

25 string of at least 3 continuous glutamines as, for 
example, the mAB 1C2 antibody. 

The nucleic acids molecules according to the 
invention may be advantageously included in an 

3 0 expression vector, which may be introduced into a host 
cell of prokaryotic or eukaryotic origin. Suitable 
expression vectors include plasmids, which may be used 
to express foreign DNA in bacterial or eukaryotic host 
cells, viral vectors, yeast artificial chromosomes or 

35 mammalian artificial chromosomes. The vector may be 
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transfected or transformed into host cells using 
suitable methods known in the art such as, for 
example, electroporation, microinjection, infection, 
lipoinfection and direct uptake. Such methods are 
5 described in more detail, for example, by Sambrook et 
ai., "Molecular Cloning: A Laboratory Manual", 2nd ed. 
(1989) and by Ausbel et al . "Current Protocols in 
Molecular Biology", (1994). 

10 Also provided by the invention is a host cell, 

tissue or organism comprising the expression vector 
according to the invention. The invention further 
provides a transgenic host cell, tissue or organism 
comprising a transgene capable of encoding the 

15 proteins of the invention, which may comprise a 

genomic DNA or a cDNA. The transgene may be present in 
the trangenic host cell, tissue or organism either 
stably integrated into the genome or in an extra 
chromosomal state. 

20 

A nucleic acid molecule comprising a nucleotide 
sequence shown in any one of Figures 15a, 16a, 17a or 
18a as well as the protein encoded by it may be 
therapeutically used in the treatment of mood 

25 disorders or related disorders in patients which 

present a trinucleotide repeat expansion (TRE) in at 
least one of the foresaid sequences. 

Accordingly, in another of its aspects the 
invention provides the above described nucleic acid 

30 molecules and proteins for use as medicaments for the 
treatment of individuals with a mood disorder or 
related disorder. Preferably, the nucleic acid or the 
protein is present in an appropriate carrier or 
delivery vehicle. As an example, the nucleic acid 

35 inserted into a vector, for example a plasmid or a 
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viral vector, may be transfected into a mammalian cell 
such as a somatic cell or a mammalian germ line cell, 
as described above- The cell to be transfected can be 
present in a biological sample obtained from the 
5 patient, for example blood or bone marrow, or can be 
obtained from cell culture. After transfection the 
sample may be returned or readministered to a patient 
according to methods known to those practised in the 
art, for example, methods as described in Kasid et 

10 al., Proa. Natl. Acad- Sci. USA (1990) 87:473; 

Rosenberg et al . (1990) New Eng. J. Med. 323: 570 ; 
Williams et al . (1994) Nature 310: 476; Dick et al . 
(1985) Cell 42:71; Keller et al . (1985) Nature 318: 
149 and Anderson et al. (1994) US Patent N. 5,399,346. 

15 There are a number of viral vectors known to 

those skilled in the art which can be used to 
introduce the nucleic acid into mammalian cells, for 
example retroviruses, parvoviruses, coronaviruses, 
negative strand RNA viruses such as picornaviruses or 

20 alphaviruses and double stranded DNA viruses including 
adenoviruses, herpesviruses such as Herpes Simplex 
virus types 1 and 2, Epstein-Barr virus or 
cytomegalovirus and poxviruses such as vaccinia 
fowlpox or canarypox. Other viruses include, for 

25 example, Norwalk viruses, togaviruses, flavi viruses, 
reoviruses, papovaviruses, hepadnaviruses and 
hepatitis viruses. 

A preferred method to introduce nucleic acid that 
encodes the desired protein into cells is through the 

30 use of engineered viral vectors. These vectors 
provide a means to introduce nucleic acids into 
cycling and quiescent cells and have been modified to 
reduce cytotoxicity and to improve genetic stability. 
The preparation and use of engineered Herpes simplex 

35 virus type 1 (O.K. Krisky, et al , (1997) Gene Therapy 
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4(10): 1120-1125), adenoviral (A* Amalfitanl, et 
al. (1998) Journal of Virology 72 (2) : 926-933) , 
attenuated lentiviral (R. 2ufferey, et al . , Nature 
Biotechnology (1997) 15(9)871-875) and 
5 adenoviral/retroviral chimeric (M- Feng, et al, Nature 
Biotechnology (1997) 15 (9) :866-870) vectors are known 
to the skilled artisan. 

The protein may be administered using methods 
known in the art. For example, the mode of 

10 administration is preferably at the location of the 

target cells. The administration can be by injection. 
Other modes of administration (parenteral, mucosal , 
systemic, implant, intraperitoneal, etc.) are 
generally known in the art* The agents can, 

15 preferably, be administered in a pharmaceutical^ 
acceptable carrier, such as saline, sterile water, 
Ringer's solution and isotonic sodium chloride 
solution. 

20 In yet another of its aspects the invention 

provides assay methods for identifying compounds that 
are able to enhance or inhibit the expression of the 
proteins of the invention. These assays can be 
conducted, for example, by transfecting a nucleic acid 

25 of the invention into host cells and then comparing 

the levels of mRNA transcript or the levels of protein 
expressed from said nucleic acids in the presence or 
absence of the compound. 

Different methods, well known to those skilled in the 
30 art can be employed in order to measure transcription 
or expression levels. 

Alternatively, it is possible to identify compounds 
that modulate transcription by using a reporter gene 
assay of the type well known in the art. In such an 
3 5 assay a reporter plasmid is constructed in which the 
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promoter of a gene, whose levels of transcription are 
to be monitored, is positioned upstream of a gene 
capable of expressing a reporter molecule. The 
reporter molecule is a molecule whose level of 
5 expression can be easily detected and may be either 

the transcript of the reporter gene or a protein with 
characteristics that allow it to be detected. For 
example, the molecule may be a fluorescent protein 
such as green fluorescent protein (GFP) . 
10 Compound assays may be conducted by introducing 

the reporter plasmid described above into an 
appropriate host cell and then measuring the amount of 
reporter molecule expressed in the presence or absence 
of the compound to be tested . 

15 

The invention also relates to compounds 
identified by the above mentioned methods. 

Further embodiments of the present invention 

20 relate to methods of identifying the relevant gene or 
genes which involve the sub-cloning of YAC DNA as 
defined above into vectors such as BAC (bacterial 
artificial chromosome) or PAC (Pi or phage artificial 
chromosome) or cosmid vectors such as exon-trap cosmid 

25 vectors. The starting point for such methods is the 
construction of a contig map of the region of human 
chromosome ISq between polymorphic markers D18S60 and 
D18S61. To this end the present inventors have 
sequenced the end regions of the fragment of human DNA 

30 in each of the seven aforementioned YAC clones and 
these sequences are disclosed herein- Following 
subcloning of YAC DNA into other vectors as described 
above, probes comprising these end sequences or 
portions thereof, in particular those sequences shown 

35 in Figures 1 to 11 herein, together with any known 
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sequenced tagged site (STS) in this region, as 
described in the YAC clone contig shown herein , as can 
be used to detect overlaps between said subclones and 
a contig map can be constructed. Also the known 
5 sequences in the current YAC contig can be used for 
the generation of contig map subclones. 

One route by which a gene or genes which is 
associated with a mood disorder or associated disorder 
10 can be identified is by use of the known technique of 
exon trapping. 

This is an artificial RNA splicing assay, most 
often making use in current protocols of a specialized 
exon-trap ccsmid vector. The vector contains an 
15 artificial minigene consisting of a segment of the 

SV40 genome containing an origin of replication and a 
powerful promoter sequence, two spl icing-competent 
exons separated by an intron which contains a multiple 
cloning site and an SV40 polyadenylation site. 
20 The YAC DNA is subcloned in the exon-trap vector 

and the recombinant DNA is transfected into a strain 
of mammalian cells* Transcription from the SV40 
promoter results in an RNA transcript which normally 
splices to include the two exons of the minigene. If 
25 the cloned DNA itself contains a functional exon, it 
can be spliced to the exons present in the vector's 
minigene. Using reverse transcriptase a cDNA copy can 
be made and using specific PCK primers, splicing 
events involving exons of the insert DNA can be 
30 identified. Such a procedure can identify coding 

regions in the YAC DNA which can be compared to the 
equivalent regions of DNA from a person afflicted with 
a mood disorder or related disorder to identify the 
relevant gene, 

35 Accordingly, in a further aspect the invention 
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comprises a method of identifying at least one human 
gene, including mutated variants and polymorphisms 
thereof , which is associated with a stood disorder or 
related disorder which comprises the steps of: 

5 

(a) transfecting mammalian cells with exon trap 
costaid vectors prepared and mapped as described above; 

(b) culturing said mammalian cells in an 
10 appropriate medium; 

(c) isolating RNA transcripts expressed from the 
SV40 promoter; 

15 (d) preparing cDNA from said RNA transcripts; 

(e) identifying splicing events involving exons 
of the DNA subcloned into said exon trap cosmid 
vectors to elucidate positions of coding regions in 

20 said subcloned DNA; 

(f) detecting differences between said coding 
regions and equivalent regions in the DNA of an 
individual afflicted with said mood disorder or 

25 related disorder; and 

(g) identifying said gene or mutated or 
polymorphic variant thereof which is associated with 
said mood disorder or related disorders. 

30 

As an alternative to exon trapping the YAC DNA 
may be subcloned into BAG, PAC, cosmid or other 
vectors and a contig map constructed as described 
above. There are a variety of known methods available 
35 by which the position of relevant genes on the 
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subcloned DNA can be established as follows: 

(a) cDNA selection or capture (also called direct 
selection and cDNA selection) : this method involves 

5 the forming of genomic DNA/ c DNA heteroduplexes by 
hybridizing a cloned DNA (e*g. an insert of a YAC 
DNA), to a complex mixture of cDNAs , such as the 
inserts of all cDNA clones from a specific (e.g. 
brain) cDNA library. Related sequences will hybridize 
10 and can be enriched in subsequent steps using biotin- 
streptavidine capturing and PCR (or related 
techniques) ; 

(b) hybridization to mRNA/cDNA: a genomic clone 
15 (e.g. the insert of a specific ccsmid) can be 

hybridized to a Northern blot of mRNA from a panel of 
culture cell lines or against appropriate (e.g. brain) 
cDNA libraries. A positive signal can indicate the 
presence of a gene within the cloned fragment; 

20 

(c) CpG island identification; CpG or HTF islands 
are short (about 1 kb) hypomethylated GC-rich (> 60%) 
sequences which are often found at the 5' ends of 
genes. CpG islands often have restriction sites for 

25 several rare-cutter restriction enzymes. Clustering 
of rare-cutter restriction sites is indicative of a 
CpG island and therefore of a possible gene. CpG 
islands can be detected by hybridization of a DNA 
clone to Southern blots of genomic DNA digested with 

3 0 rare-cutting enzymes, or by island-rescue PCR 

(isolation of CpG islands from YACs by amplifying 
sequences between islands and neighbouring Ala- 
repeats) ; 

35 (d) zoo-blotting: hybridizing a DNA clone (e.g. 
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the insert of a specific cosmid) at reduced stringency 
against a southern blot of genomic DNA samples from a 
variety of animal species. Detection of hybridization 
signals can suggest conserved sequences, indicating a 
5 possible gene. 

Accordingly, in a further aspect the invention 
comprises a method of identifying at least one human 
gene including mutated and polymorphic variants 
10 thereof which is associated with a mood disorder or 
related disorder which comprises the steps of: 

(a) subcloning the YAC DNA as described above 
into a cosmid, BAG, PAC or other vector; 

15 

(b) using the nucleotide sequences shown in any 
one of Figures 1 to 11 or any other sequenced tagged 
site (STS) in this region as in the YAC clone contig 
described herein, or part thereof consisting of not 

20 less than 14 contiguous bases or the complement 

thereof, to detect overlaps amongst the subclones and 
construct a map thereof; 

(c) identifying the position of genes within the 
25 subcloned DNA by one or more of CpG island 

identification, zoo-blotting, hybridization of the 
subcloned DNA to a cDNA library or a Northern blot of 
mRNA from a panel of culture cell lines; 

30 (d) detecting differences between said genes and 

equivalent region of the DNA of an individual 
afflicted with a mood disorder or related disorder; 
and 

35 (e) identifying said gene which is associated 
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with said mood disorders or related disorders. 

If the cloned YAC DNA is sequenced, computer 
analysis can be used to establish the presence of 
5 relevant genes. Techniques such as homology searching 
and exon prediction may be applied. 

Once a candidate gene has been isolated in 
accordance with the methods of the invention more 
detailed comparisons may be made between the gene from 

10 a normal individual and one afflicted with a mood 
disorder such as a bipolar spectrum disorder. For 
example, there are two methods, described as "mutation 
testing", by which a mutation or polymorphism in a DNA 
sequence can be identified » In the first the DNA 

15 sample may be tested for the presence or absence of 
one specific mutation but this requires knowledge of 
what the mutation might be. In the second a sample of 
DNA is screened for any deviation from a control 
(normal) DNA. This latter method is more useful for 

20 identifying candidate genes where a mutation is not 
identified in advance* 

In addition, the following techniques may be 
further applied to a gene identified by the above- 
25 described methods to identify differences between 
genes from normal or healthy individuals and those 
afflicted with a mood disorder or related disorder: 

(a) Southern blotting techniques: a clone is 
30 hybridized to nylon membranes containing genomic DNA 
digested with different restriction enzymes of 
patients and healthy individuals. Large differences 
between patients and healthy individuals can be 
visualized using a radioactive labelling protocol ; 

35 
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(b) heteroduplex mobility in polyacrylamide gels: 
this technique is based on the fact that the mobility 
of heteroduplexes in non-denaturing polyacrylamide 
gels is less than the mobility of homoduplexes . It 

5 is most effective for fragments under 2 00 bp; 

(c) single-strand conformational polymorphism 
analysis (SSCP or SSCA) : single stranded DNA folds up 
to form complex structures that are stabilized by weak 

10 intramolecular bonds. The electrophoretic mobilities 
of these structures on non-denaturing polyacrylamide 
gels depends on their chain lengths and on their 
conformation; 

15 (d) chemical cleavage of mismatches (CCM) : a 

radiolabeled probe is hybridized to the test DNA 7 and 
mismatches detected by a series of chemical reactions 
that cleave one strand of the DNA at the site of the 
mismatch. This is a very sensitive method and can be 

20 .applied to kilobase-length samples; 

(e) enzymatic cleavage of mismatches: the assay 
is similar to CCM, but the cleavage is performed by 
certain bacteriophage en2ymes. 

25 

(f) denaturing gradient gel electrophoresis: in 
this technique, DNA duplexes are forced to migrate 
through an electrophoretic gel in which there is a 
gradient of increasing amounts of a denaturant 

30 (chemical or temperature). Migration continues until 
the DNA duplexes reach a position on the gel wherein 
the strands melt and separate, after which the 
denatured DMA does not migrate much further. A single 
base pair difference between a normal and a mutant DNA 

35 duplex is sufficient to cause them to migrate to 
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different positions in the gel; 
(g) direct DNA sequencing. 

5 It will be appreciated that with respect to the 

methods described herein, in the step of detecting 
differences between coding regions from the YAC and 
the DNA of an individual afflicted with a mood 
disorder or related disorder, the said individual may 
10 be anybody with the disorder and not necessary a 
member of family MAD31. 

In accordance with further aspects the present 
invention provides an isolated human gene and variants 
15 thereof associated with a mood disorder or related 

disorder and which is obtainable by any of the above 
described methods, an isolated human protein encoded 
by said gene and a cDNA encoding said protein. 

20 In the experimental report which follows 

reference will be made to the following figures: 

FIGURE 1 shows a sequence of nucleotides which is 
the left arm end-sequence of YAC 766_f„12; 

25 

FIGURE 2 shows a sequence of nucleotides which is 
a right arm end-sequence of YAC 766. f. 12; 

FIGURE 3 shows a sequence of nucleotides which is 
30 a left arm end-sequence of YAC 717,d.3; 

FIGURE 4 shows a sequence of nucleotides which is 
a right arm end-sequence of YAC 7i7,d.3; 

35 FIGURE 5 shows a sequence of nucleotides which is 
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a right arm end-sequence of YAC 7 31.C.7; 

FIGURE 6 shows a sequence of nucleotides which is 
a left arm end-sequence of YAC 752_g.8; 

5 

FIGURE 7 shows a sequence of nucleotides which is 
a left arm end-sequence of YAC 942.C.3; 

FIGURE S shows a sequence of nucleotides which is 
10 a right arm end-sequence of YAC 942. c.3; 

FIGURE 9 shows a sequence of nucleotides which is 
a left arm end-sequence of YAC 961X9; 

15 FIGURE 10 shows a sequence of nucleotides which 

is a right arm end-sequence of YAC 961. h M 9; 

FIGURE 11 shows a sequence of nucleotides which 
is a left arm end-sequence of YAC 907. e.l; 



20 



FIGURE 12 shows a pedigree of family MAD31; 



FIGURE 13 shows the haplotype analysis for family 
MAD13 . Affected individuals are represented by filled 
25 diamonds, open diamonds represent individuals who were 
asymptomatic at the last psychiatric evaluation. Dark 
gray bars represent markers for which it cannot be 
deduced if they are recombinant; and 

30 FIGURE 14 shows the YAC contig map of the region 

of human chromosome 13 between the polymorphic markers 
D13560 and D18561- Black lines represent positive 
hits. YACs are not drawn to scale. 

35 FIGURE 15 shows (a) a CAG repeat (in bold) and 
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surrounding nucleotide sequence isolated from YAC 
961_h_9. The sequence in italics is derived from End 
Rescue of the fragmented YAC. (b) PCR primers that can 
be used to determine the extent of trinucleotide 
5 repeats in the sequence. 

FIGURE 16 shows (a) a CAG repeat (in bold} and 
surrounding nucleotide sequence isolated from YAC 
766_f_12* The sequence in italics is derived from End 
10 Rescue of the fragmented YAC. (b) PCR primers that can 
be used to determine the extent of trinucleotide 
repeats in the sequence. 



FIGURE 17 shows (a) a CAG repeat (in bold) and 
15 surrounding nucleotide sequence isolated from YAC 

766_f_12* The sequence in italics is derived from End 
Rescue of the fragmented YAC. (b) PCR primers that can 
be used to determine the extent of trinucleotide 
repeats in the sequence. 

20 

FIGURE 18 shows (a) a CTG repeat (in bold) and 
surrounding nucleotide sequence isolated from YAC 
907__e_l. The sequence in italics is derived from End 
Rescue of the fragmented YAC. (b) PCR primers that can 
25 be used to determine the extent of trinucleotide 
repeats in the sequence. 



30 



Experimental 1 

( a ) Family Data 



Clinical diagnoses in MAD31, a Belgian family with a 
BPII proband were described in detail in De bruyn et 
35 al 1996. in that study only the 15 family members who 
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were informative for linkage analysis were selected 
for additional genotyping. The different clinical 
diagnoses in the family were as follows: 
1 BFI, 2 BPIX, 2UP, 4 Major depressive disorder (MDD) , 
5 1 SAm and 1 SAd. 

The pedigree of the MAD31 family is shown in 
Figure 12. 

(b) Genotypinq of Family Members 

10 

All short tandem repeat (STR) genetic markers are di^ 
or tetranucleotide repeat polymorphisms. Information 
concerning the genetic markers used in this study was 
obtained from several sources on the internet: Genome 
15 DataBase (GDB, http: //gdbww. gdb.org/ } , GenBank 

(http://ww.ncbi.nlm.nih.gov/) , Cooperative Human 
Linkage Center (CHLC, http://www.chlc.org/), Eccles 
Institute of Human Genetics (EIHG, 
http://wvw.genetics.utah.edu/) and Genethon 

20 (http: //www. genethon - fr/) . Standard PCR was performed 
in a 25 fil volume containing 100 ng genomic DNA, 200 
mM of each dNTP, 1.25 mM MgCl 2 , 30 pmol of each 
primer and 0.2 units Goldstar DNA polymerase 
(Eurogentec) . One primer was end-labelled before PCR 

25 with [ gamma - 22 P ] ATP and T4 polynucleotide kinase. After 
an initial denaturation step at 94 'c for 2 min, 27 
cycles were performed at 94 "C for 1 min, at the 
appropriate annealing temperature for 1.5 min and 
extension at 72 *C for 2 min. Finally, an additional 

30 elongation step was performed at 72' C for 5 min. PCR 
products were detected by electrophoresis on a 6% 
denaturing polyacry lamide gel and by exposure to an x- 
ray sensitive film. Successfully analysed STSs, STRs 
and ESTs covering the refined candidate region are 

35 fully described herein on pages 36 to 54. 
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(c) Lod score analysis. 

Two-point lod scores were calculated for 3 
different disease models using Fastlink 2.2. 
5 (Cottingham et al. 1993)- For all models, a disease 
gene frequency of 1% and a phenocopy rate of i/iooo 
was used. Model 1 included all patients and unaffected 
individuals with the latter individuals being assigned 
to a disease penetrance class depending on their age 

10 at examination. The 9 age^dependent penetrance classes 
as described by De bruyn et al (1996) were multiplied 
by a factor 0,7 corresponding to a reduction of the 
maximal penetrance of 99% to 70% for individuals older 
than €0 years (Ott 1991} . Model 2 is similar to model 

15 l, but patients were assigned a diagnostic stability 
score, calculated based on clinical data such as the 
number of episodes , the number of symptoms during the 
worst episode and history of treatment (Rice et al. 
1937, De bruyn et al. 1996). Model 3 is as model 1 but 

20 includes only patients, 

< d ) Construction of the YAC contig - protocols 

Growing of YACs and extraction of YAC DNA was 
25 done according to standard protocols (Silverman, 

1995) . For the construction of the YAOcontig spanning 
the chromosome lag candidate region, the data of the 
physical map based on sequence tagged sites (STSs) 
(Hudson et al. 1995) was consulted on the Whitehead 
30 Institute (WI) Internet site (http: //www- 
genome, wi.mit.edu/ ) . CEPH mega-YACs were obtained from 
the YAC Screening Centre Leiden (YSCL, the 
Netherlands) and from CEPH (Paris, France). The YACs 
were analyzed for the presence of STSs and STRs, 
35 previously located between Disssi and D18S61, by 
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touchdown PCR amplification. Information on the 
STSs/STRs was obtained from the WI, GDB, Genethon, 
CHLC and GenBank sites on the Internet ♦ Thirty PCR 
cycles consisted of: denaturation at 94 *c for i min, 
5 annealing (2 cycles for each temperature) starting 
from 65 *c and decreasing to 51 *C for 1.5 min and 
extension at 72 *C for 2 min. This was followed by 10 
cycles of denaturation at 94 *c for 1 min, annealing at 
50 'C for 1.5 min and extension at 72 *C for 2 min- A 
10 final extension step was performed for 10 min at 72 *C. 
Amplified products were visualised by electrophoresis 
on a 1% TBE agarose gel and ethidium bromide staining. 

Ordering of the STR markers > 

15 

Twelve STR markers, previously located between 
D18S51 and D18S61, were tested for cosegregation with 
bipolar disease in family MAD31 . The parental 
haplotypes were reconstructed from genotype 

20 information of the siblings in family MAD31 and 

minimalizing the number of possible recombinants. The 
result of this analysis is shown in Figure 13. The 
father was not informative for 3 markers, the mother 
was not informative for 5 markers. Haplotypes in 

25 family MAD31 suggested the following order for the 
STR markers analysed: cen«[S5l-S68~S346]-[S55-S969- 
S1113-S483-S465]-£S876-S477]-S979~[S4 66-S817-S61]~tel. 
The order relative to each other of the markers 
between brackets could not be inferred from our 

30 haplotype data. The marker order in family MAD31 was 
compared with the marker order obtained using 
different mapping techniques and the results shown in 
Table l below, 
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Table 1. Comparison of the order of the markers within [he 18q candidate region for bipolar 



disorder, among several maps. 



5 


Marker* 


Genethon 


Genetic maps 

Marsnnela 


Radiation hybrid map 
(uiacaione et ai. iy?o; 






D18S51 




(-)3.4cM 


(-)27.9 cR 




10 


D18S68 
D18S346 


OcM 


OcM 
5.3 cM 


OcR 

52.2 cR 






D18S55 


0.1 cM 


OcM 


72.5 cR 




X -3 


D18S969 
D18S1U3 


0.7 cM 


0.6 cM 








D18S483 


2.5 cM 


3.2 cM 


88 cR 




£ U 


DI8S465 


4.5 cM 


5.3 cM 


101.3 cR 






Dlooo/O 












DI8S477 


4.4 cM 


5.3 cM 


166.4 cR 




25 


D18S979 




8.9 cM 








DI8S466 


7.6 cM 


11.1 cM 


212.4 cR 






DI8S61 


8.4 cM 


11.8 cM 


249.5 cR 




30 


DI8S817 




5.3 cM 


260.6 cR 






* Order according 


to haplotyping results in family MAD3 


1. 






{-) Marker is located proximal of D18S68. 
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D18S68, common to all 3 maps, was taken as the 
map anchor point, and the genetic distance in cM or cR 
of the other markers relative to D18S6S are given. The 
marker order is in good agreement with the order of 
5 the markers on the recently published chromosome 16 

radiation hybrid map (Giacalone et al. (1996) Genomics 
37:9-18 ) and the WI YAC-contig map (http :/ /www- 
genome .wi.mit.edu/ ) • However/ a few discrepancies with 
other maps were observed. The only discrepancy with 
10 the Genethon genetic map is the reversed order of 

018S465 and D18S477. Two discrepancies were observed 
with the Marshfield map 

(http://www.marshmed.org/genetics/). The present 
inventors mapped D18S346 above D18S55 based on 

15 maternal haplotypes, but on the Marshfield maps 

D1SS346 is located between D18S483 and D18S979. The 
inventors also placed D18S817 below D18S979, but on 
the Marshfield map this marker is located between 
D18S465 and D18S477. However, the location of D18S346 

20 and D18S817 is in agreement with the chromosome 18 

radiation hybrid map of Giacalone et al. (1996) . One 
discrepancy was also observed with the WI radiation 
hybrid map (http://wwv-genome.wi.mit.edu/), in which 
D18S68 was located below D18S465. However, the 

25 inventors as well as other maps placed this marker 
above D18S55. 

(f ) Lod score analysis and refinement of the 
candidate region. 

30 

Lod score analysis gave positive results with all 
markers, confirming the previous observation that 
18q2i.33-q23 is implicated in BP disease, at least in 
family MAD31 (De bruyn et al- 1996)* Summary 
35 statistics of the lod score analysis under all models 
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are given in table 2 below. 
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The highest two-point lod score (+2.01 at 8=0.0} 
was obtained with markers D18S1113, D18S876 and 
D18S477 under model 1 in the absence of recombinants 
(table 2). In model 1, all individuals with a BP 
5 spectrum disorder are considered affected and fully 
contributing to the linkage analysis. 
Before the fine mapping the candidate region was 
flanked by D18S51 and D13S61, which are separated by a 
genetic distance of 15.2 cM on the Marshfield map or 

10 13.1 cM on the Gen£thon map. The informative 

recombinants with D18S51 and D18S61 were observed in 2 
affected individuals (II. 10 and ll.ll in Fig. 13). 
However , since no other markers were tested within the 
candidate region it was not known whether these 

15 individuals actually shared a region identical-by- 
descent (IBD) * The additional genetic mapping data now 
indicate that all affected individuals are sharing 
alleles at D18S969, D18S1113, D18S876 and D18S477 
(Fig. 13, boxed haplotype) . Also, alleles from markers 

20 D18S483 and D18S465 are probably IBD, but these 

markers were not informative in the affected parent 
1.1* Obligate recombinants were observed with the STR 
markers D13S68, D18S346, D18S979 and D18S817 (Table 2, 
fig* 13} Since discrepancies between different maps 
25 were observed for the locations of D18S346 and 

D18S817, the present inventors used D18S63 and D18S979 
to redefine the candidate region for BP disease. The 
genetic distance between these 2 markers is 8.9 cM 
based on the Marshfield genetic map 
30 (http. //wvv.marshmed*org/genetics/) , 

(g) Construction of the YAC contia. 

According to the Wl integrated map 56 CEPH 
35 megaYACs are located in the initial candidate region 
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contained between D18S52 and D1SS61 (Chumakov et al. 
{1995} Nature 377 Suppl., De bruyn et al. (1996)}- 
From these YACs, those were selected that were located 
in the region between D1SS60 and D18S61. D18S51 is not 
5 presented on the WI map, but is located close to 
D18S60 according to the Marshfield genetic map 
(http.// www, marshmed.org/genetics/) . To limit the 
number of potential chimaeric YACs, YACs were 
eliminated that were also positive for non-chromosome 
10 18 STSs. As such, 25 YACs were selected (see Figure 

14), and placed in a contig based on the technique of 
YAC contig mapping, i,e. sequences from sequence 
tagged sites (STSs), simple tandem repeats (STRs) and 
expressed sequenced tags (ESTs) , known to map between 
15 D18S60 and D18S61, were amplified by PGR on the DNA 
from the YAC clones* The STS, STR and EST sequences 
used, are described from page 36 to 54. Positive YAC 
clones were assembled in a YAC contig map (Figure 14} . 

Three gaps remained in the YAC contig, of 
20 which one, between D18S876 and GCT3G01, was located in 
the refined candidate region. To close the gap 
between D18S876 and GCT3G01, 14 YAC clones (Table 3, 
on page 62} were further analysed. End fragments from 
YAC clones 766. f. 12 (SV11R) , 752„g v 8 (SV31L) , 942. c_3 
2 5 (SV10R) were obtained and sequenced (see pages 55-61) , 
Primers from these three sequences were selected, and 
DNA of each of the 14 YAC clones was amplified by PCR, 
As indicated in Table 3, overlaps were obtained 
between 7 YAC clones on the centromeric side, and two 
30 YAC clones on the telomeric site (717_d.3 and 907.e.l) . 

The final YAC contig is shown in Figure 14. 
In the figure, only the YAC clones which rendered 
unambiguous hits with the chromosome 18 STSs, STRs and 
ESTs are shown. In a few cases, weak positive signals 
35 were also obtained with some of the YAC clones, which 
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likely represent false positive results* However, 
these signals did not influence the alignment of the 
YAC clones in the contig* Although, all YACs known to 
map in the region were tested as well as all available 
5 STSs/STRs, initially, the gap in the YAC contig was 
not closed. However, this was subsequently achieved 
by determining the end-sequences of the eight selected 
YACs (see below) « The order of the markers provided by 
the YAC contig map is in complete agreement with the 
10 marker order provided by the WI map which integrates 

information from the genetic map, the radiation hybrid 
map and the STS YAC contig map (Hudson et al, 1995). 
Also, the YAC contig map confirms the order of the STR 
markers as suggested by the haplotype analysis in 
15 family MAD31* Moreover, the YAC contig map provides 
additional information on the relative order of the 
STR markers, For example, D18S55 is present in YAC 
931_g_10 but not in 931_f_l (Fig. 14), separating 
D18S55 from its cluster [355-S969-S1113-S483-S465] 
20 obtained by haplotype analysis in family MAD31. The 
centromeric location of D18S55 is defined by the 
STS/STR content of surrounding YACs (Fig. 14). If we 
combine the haplotype data and the YAC contig map the 
following order of STR markers is obtained: cen- [$51- 
25 S68-S34 6]-S55-[S969-S1113] - [S4S3-S4S5 ] -S876-S4 77-S9 79- 
S46S- [ S817-S61 ] -tel . 

Out of the 25 YAC clones spanning the whole 
contig, seven YAC clones were selected in order to 
identify the minimal tiling path (Table 4). These 7 
30 YAC clones cover the whole refined chromosome 18 

region. Furthermore , YAC clones should preferably be 
non-chimeric, i.e. they should only contain fragments 
from human chromosome 18. In order to examine for the 
presence of chimerism, both ends of these YACs were 
35 subcloned and sequenced (pages 55 to 61) . For each of 
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the sequences, primers were obtained, and DNA from a 
monochromosomal mapping panel was amplified by PCR 
using these primers. As indicated on pages 55 to 61, 
some of the YAC clones contained fragments from other 
5 chromosomes, apart from human chromosome 18. 

Three YAC clones were then selected 
comprising the minimum' tiling path (Table 5) . These 
three YAC clones were stable as determined by pulsed 
field gel electrophoresis and their seizes correspond 
10 well to the published sizes. These YAC clones were 
transferred to other host yeast strains for 
restriction mapping, and are the subject to further 
subcloning. 



15 



35 
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Description of the succesfuily analysed STSs, STRs and ESTs 
covering the refined candidate region. 

Explanations: 

• STS: Sequence Tagged Site 

• SIR; Simple Tandem Repeat 

• EST: Expressed Sequence Tag 

These markers are ordered from the centromere to the telomere. Only the 
markers that were effectively tested and that worked on the YACs are given- 



List: 

1. D18S60; 

Database ID: AFM178XE3 (Also known as 178xe3, Z16781, D18S60) 
Source: J Weissenbach, Genethon: genetically mapped pofymorphySTSs 
Chromosome: Chr18 

Primers: 

Left = CCTGGCTCACCTGGCA 

Right = TTGT AG CATC G TTGTAATG TTC C 

Product Length = 157 

Review complete sequence: 
AGCTA TCCTGGCTCACCTGGCAA AAATACAGTGTATACACACACACACAC 
ACACACACACACACACACAGAGTGTNTTANTNATTCCAGCAAATAATATTA 
CATATAAAAGATCTAATTGGTTCATCATGTAAATTTAGT AGGAACATTACA 
ACGATGCTACAA GANTTTATCCAAAACTGAGATTTCCTTAGAATATCTGTT 
AAMGTAATmArrCAGTTMTAGAMTTCTAnGAAMCATCAMCTTAT 
AAAGCT 

GenbankID: Z16781 

Description: H. sapiens (D18S60) DNA segment containing (CA) repeat; 
clone 

Search for GOB entry 
2. Wl-9222 

Database ID, UTR-03540 (Also known as G06101, D18S1Q33, 9222, 
X53657) 

Source: VVICGR: PnV^srs derived from Genbank sequences 
Chromosome. ChrlS 

Primers 
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Left = GATCCCATAAAGCTACGAGGG 

Right = GAGTCTAAAGACAAGAAAGCATTGC 

Product Length = 99 

Review complete sequence: 
TCTTCTTACCCCTTGGAAGAAGACTGTTTCCAAATAATTTGAACAGCTTG 
CTGCTAAATGGGACCCAATTTTTGGCCTATAGACACTTATGTATTGTTTTC 
GAATACGTCAGATTGGACCAGTGCTCTTCAGGAATGTGGCTGCAAGCAA 
GGGGCTAGAAGTTCACCTCCTGACAGTATTATTAATACTATGCAAATATG 
GAATAGGAGACCATTTGATTTTCTAGGCTTTGTGGTAGAGAGGTGAAGG 
TATGAGAATTAATAGCGTGTGAACAAAGTAAAGAACAGGATTCCAGAATG 
ATCATTAAATTTGTTTCTATTTATTCTTTTTTGCCCCCCTAGAGATTAAGTC 
CAGAAATGTACTTTCTGGCACATAAAGAAATCTTGAGGACTTTGTTTAAAC 
CTTCCATAAAAAAACAATTTTCGGTTTCTCGGGTNNNNNNNNNNNNNNNN 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNT 
TCTTTCTTTGTGTATTTTATTCAAGATGAGTTGGACCCATTGCCAGTGAGT 
CTGAATGTCACTGACAGCCCTGTGTTGTGCTCAGGACTCACTCTGCTGC 
TGGTGGAAACTCATGGCTTCTCTCTCTCTT TGATCCCATAAAGCTACGAG 
SSGGACGGGAGAGGGCAGTGCMTGGGAAGTAMGAGATATTTTCCAG 
TAGGAAA AGCAATGCTTTCTTGTCTTTAGACTCA AATGCTTAGGGAACGT 
TTCATTTCTCATTCATGGGGAAAGGCAGCCTCCTTAAATGTTTTCTGMG 
AGCGGTAAAATCTAGAAGCTTAAGAATTTACAGTTCCTTCAATAACCATGA 
TGACCTGAAGTTCACCTATCCCATTTTAGCATCTACTTGTTTTTCCCATCT 
CTTCCTTTCCAATTTTGCTTATACTGCT GTMT ATTTTTGTN NNNNN NNNN 
NNNNNNNNNNNNNNNGACCAGCTAAMTTTTCGACTTGACTTTTTAACTT 
AACTCATGAATTAATTAAAGCAAATGAAAAAATTAAAAAGTGTGAC I I I I J 
CTCGGAGCATATATGTAGCTTTTAGGAAAGGCTGATGATGGTAT AAAGT T 
TGCTCATTMGAAAAAAAGACAAGGCTGATTTTGAAGAGAGTTGCTTTTG 
AAATAAAATGATCA 

Genbank ID; X63657 
Description: H. sapiens fvtl mRNA 
Search for GDB entry 

3. WI-7336: 

Database ID: UTR-04664 (Also known as PIS. GQO-679-135, G06527. 7336. 
U04313) 

Source: WICGR: Primers derived from Genbank sequences 
Chromosome: Chrl 3 

Primers' 

Left = AGACATTCTCGCTTCCCTGA 
Right = AATTTTGACCCCTTATGGGC 
Product Length = 332 
Review complete sequence: 

TAAGTGGCATAGCCCATGTTAAGTCCTCCCTGACTTTTCTGTGGATGCCG 

ATTTCTGTAAACTCTGCATCCAGAGATTCATTTTCTAGATACAATAAATTG 

CTAATGTTGCTGGATCAGGAAGCCGCCAGTACTTGTCATATGTAGCCTTC 
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ACACAGATAGACCNNNNNNNNNNNCCAATTCTATCTTTTGTTTCCTTTTTT 
CCCATAAGACAATGACATACGCTTTTAATGAAAAGGAATCACGTTAGAGG 

aaamtatttattcattatttgtcaaattgtccggggtagttggcagaaat 

acagtcttccacaaagaaaattcctataaggaagatttggaagctcttct 

tcccagcactatgctttccttctttgggatagagaatgttc cagacattc 

tcgcttccctgaa agactgaagaaagtgtagtgcatgggacccacgaaa 

ctgccctggctccagtgaaacttgggcacatgctcaggctactataggt 

ccagaagtccttatgttaagccctggcaggcaggtgtttattaaaattct 

gmttttggggattttcaaaagataatattttacatacactgtatgttata 

gaacttcatggatcagatctggggcagcaacctataaatcaacaccttaa 

TATGCTGCAACAAAATGTAGAATATTCAGACAAAATGGATACATAAAGACT 

AAGT AGCCCATAAGGGGTCAAAATT TGCTGCCAAATGCGTATGCCACCA 

ACTTACAAAAACACTTCGTTCGCAGAGCTTTTCAGATTGTGGAATGTTGG 

ATAAGGAATTATAGACCTCTAGTAGCTGAAATGCAAGACCCCAAGAGGAA 

GTTCAGATCTTAATATAAATTCACTTTCATTTrrGATAGCTGTCCCATCTG 

GTCATGTGGTTGGCACTAGACTGGTGGCAGGGGCTTCTAGCTGACTCG 

CACAGGGATTCTCACAATAGCCGATATCAGAATTTGTGTTGAAGGAACTT 

GTCTCTTCATCTAATATGATAGCGGGAAAAGGAGAGGAAACTACTGCCTT 

TAGAAAATATAAGTAAAGTGATTAAAGTGCTCACGTTACCTTGACACATAG 

TTTTTCAGTCTATGGGTTTAGTTACTTTAGATGGCAAGCATGTAACTTATA 

TTAATAGTAATTTGTAAAGTTGGGTGGATAAGCTATCCCTGTTGCCGGTT 

CATGGATTACTTCTCTATAAAAAATATATATTTACCAAAAAATTTTGTGACA 

TTCCTTCTCCCATCTCTTCCTTGACATGCATTGTAAATAGGTTCTTC7TGT 

TC TG AG ATTC AATATTG AATTT C TC C TATG C T ATTG AC AATAAAATATTATT 

GAACTACC 

Genbank ID: G06527 

Description: WiCGR- Random genome wide STSs 
4. WI-8145- 

Oatabase ID: EST1 02441 (Also known as D18S1234, G00-677-827, G06845, 
8145, T49159) 

Source: WICGR: STSs derived from dbEST sequences 
Chromosome: Chr13 

Primers: 

Left = GAAATGCACATAACATATATTTGCC 
Right = TGCTCACTGCCTATTTAATGTAGC 
Product Length = 1 34 
Review complete sequence: 

GTTGTTTGGANGCAGGTTTATTTATTATATACTTGCAATTGAATATAAGAT 

ACAGACATATATATGTGTTATGTATTTCT AGAAATGCACATAACATATATTT 

GCCTATT GTTTAATG TTTTTTC C AG AN ATTTATTAC AG AAGGG C ATG GAG 

GGATACCTACTTATTCTTCATTATGAGAACAATTAAAGGCATTTATTAGAT 

A G G AAATTAAC AG A M f: ATC TG C TTC TATAA C TTTATT AG C TAC ATTAAATA 

GGCAGTGAGCA NTAATTTAAAANCTCACCATTATATAAANTANTAAATACC 

AAAGTAAAAG 
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left and fight primer 



PCR Conditions 
Genbank ID: T49159 

Description: yb09e07.s1 Homo sapiens cDNA clone 70692 3' similar to 
gb:J02685 

UniGene Cluster Description: Human mRNA for Arg-Serpin (plasminogen 
activator-inhibitor 2, PAI-2) Search for GDB entry 



5, WI-7061: 

Database ID: UTR-02902 (Also known as PAI2, G00-678-979, G06377, 7061, 
M18082) 

Source: WICGR: Primers derived from Genbank sequences 
Chromosome: Chr18 

Primers: 

Left = TGCTCTTCTGAACAACTTCTGC 
Right = ATAGAAGGGCATGGAGGGAT 
Product Length = 338 
Review complete sequence: 

AACTAAGCGTGCTGCTTCTGCAAAAGATTTTTGTAGATGAGCTGTGTGCC 

TCAGMTTGCTAmCAMTTGCCAAAAATTTAGAGATGTTTTCTACATAT 

TTC TGCTCTTCTGAACAACTTCTGC TACCCACTAAATAAAAACACAGAAAT 

AATTAGAC AATTGTC TATTATAAC ATG AC AAC C CTATTAATCATTTG GTCT 

TCTAAAATGGGATCATGCCCATTTAGATTTTCCTTACTATCAG7TTATTTT 

T ATAAC A7TAA C TTTT A C TTTG TTATTTATTATTTTATA TAATG G TG A GTTTT 

AAATTATTGCTCACTGCCTATTTAATGTAGCTAATAAAGTTATAGAAGCAG 

ATGATCTGTTAATTTCCTATCTAATAAATGCCTTTAATTGTTCTCATAATGA 

AGAATAAGTAGGT ATCCCTCCATGCCCTTCTAT AATAAATATCTGGAAAAA 

ACATTAAACAATAG G C AAATATATGTTATGTG CATTTCTAGAAATAC ATAA 

CACATATATATGTCTGTATCTTATATTCAATTGCAAGTATATAATAAATAAA 

CCTGCTTCCAAACAACNNNNNNNNNNNNNNGGAATTC 

PCR Conditions 

Genbank ID: G06377 

Description: WICGR Random genome wide STSs 



6. D18S68 



Database ID. AFM2^5YB9 (Also known as 248yb9, Z17122, D18SS8) 
Source: J Weissenbach, Genethon. genetically mapped polymorphic STSs 
Chromosome Chrlc 
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Primers: 

Left = ATGGGAGACGTAATACACCC 
Right = ATGCTGCTGGTCTGAGG 
Product Length = 285 
Review complete sequence: 

AAAGAGTTGGGGTTGTGAATTCCCACACCAGTCAACTATTGGCTATGGG 
CTTACC ATGGGAGACGTAATACACCC GGNACTTCCAACTCACATACCAG 
AGACATGGCTCTAGCACCCAATGGAAATATGCTGAATGTTGCAGGTGCA 
AGACAGCAACAAAGCAGACAGAGGCACATAGACAAGGCACCAACAGTGT 
CCACTATACCCTGACAGTGTGGAAAGTTGTAGATAGGATGAAGAGAAAG 
AATACACACACACACACACACAGACACACACACACACACACACACACACA 
CGGTAGANACTTACTAC NCAAAGTGTGA NCCTCAGACCAGCAGCAT CTG 
GC N AAATG GTG ATC TATC AC CTTC C AG 

Genbank ID: 217122 

Description: H. sapiens (D1SS68) DNA segment containing (CA) repeat; 
clone 

7. WI-3170: 

Database ID: MR3726 (Also known as D18S1037, G04207, HALd22f2, 3170) 
Source: WICGR; Random genome wide STSs 
Chromosome: Chr18 

Primers: 

Left = TGTGCTACTGATTAAGGTAAAGGC 
Right = TGCTTCTTCAATTTGTAGAGTTGG 
Product Length = 156 
Review complete sequence 

CTGAGACAAGGCAGGCAAACAACCTCTAAAAATCTACAATTGGTGATTGG 
TGTGCTACTGATTAAGGTAAAGGC ACAGAATTATACATCCAGGTTNCTAT 
TACTTATG G C AG AC TC AGG AC C C AGG TTNAG AG ACC ACTG G C CTTAAGA 
AAAAAAATGGGGTTCCTGATTTCTGGATAATAAT CCAACTCTACAAATTGA 
AGAAGCAAC ATACC CTCTTTGTTA 

Genbank ID: G04207 

Description: WICGR: Random genome wide STSs 



8 Wl-5654: 



Database ID. MR1CSC8 (AJso known as D18S1259, G00-678-595, G05278, 
5654) 

Source: WICGR Rsrdcm genome wide STSs 
Chromosome. Cnric 
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Primers: 

Left = CTTAATGAAAAC AATGC CAGAG C 
Right = TGCAAAATGTGGAATAATCTGG 
Product Length = 149 
Review complete sequence: 

CTACAAAATGCATGTGGCTTTGGCTTTGAAATAGTACACCCTATCAAAGA 
CTAAATTT TCTTAATGAAAACAATGCCAGAGC 1 1 1 I I I CATGATATTTTGTT 
TTTAGAGATGGGGAACAATCTGGACGTTGTTTCCTTATCTGGGTGGTAAT 
CGAGGCTTAGCAATTTCCCACAGCGTTACACAAA TCCAGATTA TTCCACA 
TTTTGCAA ATA 

Genbank ID: G05278 

Description: WICGR: Random genome wide STSs 
9. 01 8555: 

Database ID: AFM122XC1 (Aiso known as 122xc1, Z16621. D18S55, 
GC378-D18S55) 

Source: J Weissenbach, Genethon: genetically mapped polymorphic STSs 

Chromosome: Chf18 

Primers: 

Left = GGGAAGTCAAATGCAAAATC 

Right = AGCTTCTGAGTAATCTTATGCTGTG 

Product Length = 143 

Review complete sequence: 
AGCTGMCATGCCTTTTCATGGAGCAGTTTCNAAATACACTTTTGGTACA 
ATCTGCAGGTGGATATTTGGAGCTCAGGAGTTTGAGACCAGCCTGGGCA 
ACATGGTGAAATCCCGTCTCTACTAAAATACAAAAAATTAGCCAGGTGTG 
G C GGCATGTG CCTGTAGNCC C AG G ATG GATTG AGTG G GTG AGATATG G 
AATAAGTGG TGGGAAGTCAAATGCAAAATCA ATTCAGTTTGTCAATATTG 
ATTCTCTATTCTAGCCTGGCGTGGTTTTTCCTCGTCACACACACACACAC 
ACACACACACACACACACACACACACACA CACAGCATAAGATTACTCAGA 
AGCT 

Genbank ID: Z16621 

Description: H. sapiens (D18S55) DNA segment containing (CA) repeat: 
clone 

10 D 185969- 

Database ID: GATA-P 1 8099 (Also known as G08003, CHLC.GATA69F01, 
CHLC. GAT A69F01.P 18099} 

Source: CHLC' gers'.ically mapped polymorphic tetranucleotide repeats 
Chromosome Chris 

Primers' 

Left = AACAAGTGTGTATGGGGGTG 
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Right = CATA7TCACCCAGTTTGTTGC 
Product Length - 365 
Review complete sequence: 

CAGGGAAATGCAAATCAAAACCACAATGAGTTATCTCCTCATACCTTTAAT 

GATGGCTAATATTAAACAAGAGA TAACAAGTGTGTATGGGGGTG TGGAG 

AAAAGAGAATGTNCGAACACTCTTGGTTGAAATATAAGTTGGTAGANCCA 

TTATGCAAAACAGTATGAATCTTTATCAGTATAANATTAGGACCTNGCATA 

TGATCNCAGCAATCNCCACNTCTGNGNGATCNCACNCNCTATCTCTCTAT 

ATCTATCTATCTATCTATCTATCTATCTATCTATCTATCTATCTATCTATCT 

ATCTGTCTGTCTATCATCTATCTATCTTCTATCTATCTATCTATCTTTCTAT 

CTATCTATCTGTCTATCTATNCCGGAATATTTTTCAGCCATNNAAATAAGG 

AAGTC C TGC TATT TG C AAC AAACTGGGTGAATATG GAGAACGTTATGCTA 

AATGCAATATGCTAAAGACAGACACAGAAAGACAAGTATGACCTCACTTA 

TATGTGGAAACTGAAAAAGCCATACTCATTACAGCAMGAGTAGAATGTT 

GGTTACCAGGGGCAAAGAGGGTAGAAATGAGGGGAGTGAGAAAATGTC 

AATCAAAGTGTAAGAATGTTATAACATAAATAAATTCATAGAG 

Genbank ID; G08003 

Description: human STS CHLC.GATA69F01.P18099 done GATA69F01. 



11. D18S1113: 

Oatabase ID: AFM2C0VG9 (Also known as D18S1 1 13, 200vg9, w2403) 
Source: J Weissenbach, Genethon: genetically mapped polymorphic STSs 
Chromosome: Chr18 

Primers: 

Left = GTTGACTCAAGTCCAAACCTG 

Right = CAAAGACATTGTAGACGTTCTCTG 

Product Length = 207 

Review complete sequence: 
AGCTGCATATAAMCTATTCCATTTCACATTTTTGAAGACATTTGTAGCCA 
TGATACTTTGCTGTTGTCTGTGGGCCACCTCTTTTTGAAGTGTGTAGTTA 
ACTGTGCTCC TGTAATC TGTTGTC TGTTGACTCAAGTCCAAAC CTG TTCT 
GCGTGGCATGTTTCTNCAACTTGATGTGATGCTATTTATCACTTTCT7TGA 
AGTTAAGTCTCTATGTCTTTGTATTCTTTCTGTGTACCCAGGGATATGTTT 
GTGCATGCACACGCATAAACACACACACACACACACACACACACAGAGA 
CAGAGACAGAGAACGTCTACAATGTCTTTGTGAG 



12. D18S868: 

Database ID: GATA-DT3S868 (Also known as G09150, CHLC GATA3E12, 
CHLC.GATA3E12.49S CHLC.496, D18SS68) 
Source- CHLC' gene:ca.'!y mapped polymorphic tetranucleotide repeats 
Chromosome. Chrl6 
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Primers: 

Left = AGCCMTACCTTGTAGTAAATATCC 

Right = GATTCTCCAGACAAATAATCCC 

Product Length = 189 

Review complete sequence: 
GAGTG AGCCAATACCTTGTAGTAAATATCCA TCTATCTTTGATGTATCTAT 
GTATCTATCTTTGTATCTATATGTCTATGTATCTATGTATGTATGTATCTAT 
CTATCATCTATCTATCTATCATCTATCTATCTATCTATCTATCTATCTATCT 
ATCTATCTATATCCNTT TGGGATTATTTGTCTGGAGAATC CTGATTAACAT 
AGTCTGCTAACTTTTATCTGTATCTCCTATGTGTATGCTTCTCCTTCTTCC 
TGTCTC TCTCTCTTCTTTGTC CTC ATTTAAN CTC CTTTCCTG GG N ATATTG 
GNAATTTTGATTGGANTCTGGACANTGTAGGAGTAAAAATTT 

Genbank ID: G09150 

Description; human STS C H LC . GATA3E 1 2. P6553 clone GATA3E12. 



13. WI-9959: 

Database ID: MR12816 {Also known as D18S1251, G00-678-524, G05488, 
9959} 

Source: WICGR: Random genome wide STSs 
Chromosome: Chrl8 

Primers: 

Left = TGCCAACAGCAGTCAAGC 

Right = AGCACCTGCAGCAGTAATAGC 

Product Lengths 110 

Review complete sequence: 
ctgttttatttqaaaaaaaaaatctqtctccaaqaagaaaaqttcattctACCTGT TGCCAACAGC 
AGTCAAGC GGACATGTTTAAAATTTTTTAAAAAAGTAl I I'll I l'l ICCAACT 
GG NGTTT MTAGCCTCATTTTGGCTTTT GCTATTACTGCTGCAGGTGCT T 
TN ATTTTTTTC C TC TG C ATTATAATTA C 

Genbank ID: G05483 

Description: WICGR: Random genome wide STSs 
Search for GDB entry 

14, D18S537: 

Database ID. CHLC GATA2E06.13 (Also known as CHLC.13, GATA2E06, 
D18S537, GATA-D18S537) 

Source: CHLC: genetically mapped polymorphic tetranucleotide repeats 
Chromosome- ChrTS 

Primers: 

Left = TCC ATC TATC TTTGATGTATC TATG 
Right = AGTTAGCAGACTATGTTAATCAGGA 
Product Length = i3i 
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Review complete sequence: 
AAAGCTGAGTGAGCCAATACCTTGTAGTAAAT ATCCATCTATCTTTGATGT 
ATCIATGTATCTATCTTTGTATCTATATGTCTATGTATCTATGTATGTATGT 
ATCTATCTATCATCTATCTATCTATCATCTATCTATCTATCTATCTATCTAT 
CTATCTATCTATCTATATCCNTTNGGTATTATTNGTCTGGNGA ATCCTGAT 
TAACATAGTCTGCTAACT TNTATCTGTATCTNCTATGTGTATGCTTCTNCT 
TCTTCCTGTCTCTCTCTCTGCTTTGTCCTCAATTNAAATCTCC 

Genbank ID: G07990 

Description: human STS CHLC.GATA2E06.P6006 clone GATA2E06. 
Search for GOB entry 

15. D18S483: 



Database ID: AFM324WC9 (Also known as 324wc9, 224399, D18S483) 
Source: J Weissenbach, Genethon: genetically mapped polymorphic STSs 
Chromosome: Chrl8 

Primers: 

Left * TTCTGCACAATTTCAATAGATTC 

Right = GAACTGAGCAAACGAGTATGA 

Product Length = 214 

Review complete sequence: 
AGCTCTGCTGGAAGAGCAGGGCTGTT TTCTGCACAATTTCAATAGATTC C 
CCTACCCTGGGTTTTTCAGTAGATAGATAGATAGATGATAGATAGGTAGA 
TAGATAGATAGATAGATAGATAGATAGATAGATAGATGATAGATAGATTTT 
ATATATAGTATATAAAATCTACACACACACACACACACACACACACACATA 
TTTGCCTTTCCTTGACT ATCATACTCGTTTGCTCAGTTC U I I I I 1 1 I I IAA 
ATTTTTGTTTGTAAATCCAAAATGCTT 

Genbank ID: Z2439S 

Description: H. sapiens (D18S483) DNA segment containing (CA) repeat; 
clone 

Search for GDB entry 
16. D18S465: 

Database ID" AFM2SGYH1 (Also known as 260yh1. Z23850, D18S465) 
Source: J Weissenbach, Genethon: genetically mapped polymorphic STSs 
Chromosome: Chr18 

Primers: 

Left = ATATTC CC C TATG G AAGTAC AG 

Right = AAAGTTAATTTTCAGGCACTCT 

Product Length = 232 

Review complef? sequence, 
AGCTCTGTCCCTCTAGAGAACGCTGACTAAT ATATTCCCCTATGGAAGTA 
CAGATGGTTTTTNTAAAATAAATTTATCTGATTGTGATGAGATAATCATCA 
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TTTTTATGTTCAGTGTTTTTCTAAA f I I I I I ATTGTTATTG I 1 I TTATACTCT 

AAATGG I 1 I I 1A AATATGCACATATGTGCATATTTTACACACACACACACA 

CACACACACACTCTCTTTATTTAGAAGCATTATAGA TAGAGTGCCTGAAAA 

TTAACTTT TAACCNAAGAAAAGACAATAAGGAACAATAGGGAAGTTATCC 

TTTGCTAAGGGTATGGAAAATATTCACATATTATTTATAACANGTTAAACC 

AAGTCATGCTTGANTATAATAGCT 

Genbank iD: 223350 

Description: H. sapiens (018S465) DNA segment containing (CA) repeat; 
clone 

Search for GDB entry 



17, D18S968: 

Database ID: GATA-P34272 (Also known as G10262, CHLC.GATA1 17C05. 
CHLC.GATA117C05.P34272) 

Source: CHLC: genetically mapped polymorphic tstranucleotide repeats 
Chromosome: Chr18 

Primers: 

Left = GAAATTAACCAGACACTCCTAACC 

Right = CTTAGAATTGCCTTTGCTGC 

Product Length = 147 

Review complete sequence: 
GAATAAAAATATGAGGTATTAGAAATTTACAGATAGGAA GAAATTAACCAG 
AC ACTC CT AAC C AC C GATNAGTTTAAAGAGG AG ATAG ATAG ATA GATG AT 
AGATAGATAGATAGATAGATAGATACCACTGAAAATGCAANCACAAATTA 
GCAGATTATATGTGA TGCAGCAAAGGCAATTCTAAG TAGATTCTAACTGC 
TAC ATTGATAG C AGTAC CC A CTG ACATTAC C GGAAAG GATG GTATC C ATA 
ACCACCTACCTATATACCTCCGCAGCTGGANATTAGGNTTAAGCTTCTTN 
GGGCNCCTGGCGGCCCCNNTTGTGGTCCCCGGTNGGNCCCCGNTTNN 
GNNTNGCTNNGNTTNCNTTGGNGNCCCCCNNTNGGTTTNNGGNNNNNT 
NNNNNTNGNNNNNTTNCCCNNNNNNNNTNTNNNNCNNNNNNNNNTNNN 
NNNNNNNNNNGGNNNNNGGGN 

Genbank ID: G 10262 

Description: human STS CHLC.GATA117C05.P34272 clone GATA117C05. 



18 GATA-P605T 

Database ID' GATA-P6051 (Also known as CHLC.GATA3E08, 
CHLC. GAT A3E08. P505 1 ) 

Source: CHLC: genetically mapped polymorphic tetranucleotide repeats 
Chromosome: Chr13 

Primers' 

Left = GCAACAACCCTAATGAGTATACG 
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Right - GAGTCTCACCAGGGCTTACA 

Product Length = 149 

Review complete sequence: 
AAAGCTGTCTCCTTTTGTAMGTGTGCTCAGAGGMTCTTTTTCAGTAAAT 
AAAGTCTGCACCCAGACATCTCACTTTGTATACCACGGAGAATTTACCAT 
GACTCTTCTCAGTGATAAACGTCAATATAGAATAATCAGGAGAAAAAGAG 
AAATCCAGTAAAGAAATAAGTCTGTAGAA AGCAACAACCCTAATGA6TAT 
A£gATATCTATCTATCTATCTATCTATCTATCTATCTATCTATCTATCTATG 
NATCTATCTATCTAACATTATATAAAATATATATTTCCTCCTGTATTGGGG 
CCCTGT GTGTAAGCCCTGGTGAGACTCA AAAATTTGANTATTCCTNTTTN 
T 

Genbank ID: G09104 

Description: human STS CHLC.GATA3E08.P6051 done GATA3E08. 
19. D18S875: 

Database ID: GATA-D18S875 (Also known as G08001, CHLC.GATA52H04, 
D18S875) 

Source: CHLC: genetically mapped polymorphic tetranucleotide repeats 
Chromosome: Chr18 

Primers. 

Left = TCCTCTCATCTCGGATATGG 

Right = AAGGCTTTCAGACTTACACTGG 

Product Length = 394 

Review complete sequence: 
TTATTTATTCACTC ATTCAATAAATATTTATGAATTTC C TTTAATGGC NAN G 
AAAGTATGTTTGGTACTGAATATGGTGAGCAAGATT TTCCTCTCATCTCG 
G^TATGGAAAGATCTTGGAAATCATTATACNTCATACTTACAATANGAAAG 
AAGCTGAGCMTTTGAAAATCMCAATTTCTTTTGTACNTGTCAGAAAAGT 
GAAGATATATTAATCAGGGTTCTTCAGAGAAACATAACCAATAGGNCACA 
GNTCTATATGNCCNCNTTTATCTATCTATCTATCTATCTATCNCTATCTAT 
CNCANACCNGGNGAANTNATNTTTGNGAGATTNATGCAAGNCTGAGAAA 
NACCNAAGAANCTGCTCCCTGTNAAACTNGAGATNCAAGAANCTGAANA 
GTATAGNTCCAGTCCNAAGTCTANAGACCTTAGAATTAGGAAAACTGATA 
CTATAAATA CCAGTGTAAGTCTGAAAGCCTT AAANACCANATAGTGCCAT 
TGAAAGGGCAGAAGACTGATGTCCCAGTTCAAGCAGGCAAAGTTAGAGA 
AGCCTTATTTTCTGCAACATTGTTCTATTCAGACCCTTNANANGATTGACN 
ATGTCCACCCA 

Genbank ID' G08001 

Description: human STS CHLC.GATA52H04 P16177 done GATA52H04. 
Search for GDB entry 

20 WI-2620 

Database ID MRX3S (Also known as GQ3S02, DISSSSO. HHAal2h3, 2620} 
Source WICGR Random genome wide STSs 
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Chromosome: Chr13 
Primers: 

Left = TCTCCAAGCTATTGATTGGATAA 

Right = TTAAGAGCCAATTTATATAAAAGCAGC 

Product Length = 177 

Review complete sequence: 
CCCCTTTTGCCAACGCCATGCTTCACGTAGGGAGCCTGACATGCAGAAA 
A CTCTCCAAGCTATTGATTGGATAAA GAGCCAGAGCTGACTGAATTCCAT 
TCTTCTTGAGCCTCTCATTCTGTGTTTCTCGAATTTTTACCAAAGCATCTT 
GACACACAAATATCTGACTCAAGGAAMGGAAAAACAACTGCTTTTTCTC 
C AGCTGCTTTTATATAAATTGGCTCTTAAA CTTTCTAAGTTTATTATGGAT 
A 

Genbank ID: G03602 

Description: WICGR: Random genome wide STSs 
Search for GDB entry 



21, Wl-4211: 

Database ID: MR663S (Also known as G03617, D18S980, 4211) 
Source: WICGR: Random genome wide STSs 
Chromosome: Chr18 

Primers: 

Left = ATGCTTCAGGATGACGTAATACA 

Right = AAATTCTCGCTGATTGGAGG 

Product Length = 113 

Review complete sequence: 
CTAGTACCATAATCCCTTTTGGAATAAACCATCCCACCTTTAGTCAGANC 
AG ATGCTTCAGGATGACGTAATACA TAATAAGCCTACTCAGTTCTACTCT 
GGCTTTGTATGTCTTCAMGTGATATTTTTTTAAGTATOCTTGTCCjCJCjC 
AATCAGCGAGAATTT 

Genbank 10: G03617 

Description: WICGR: Random genome wide STSs 
Search for GDB entry 

22. D18S876: 

Database ID: GATA-D18S876 (Also known as G09963, CHLC.GATA61E10, 
D18S876) 

Source: CHLC' genetically mapped polymorphic tetranucleotide repeats 
Chromosome: Chria 

Primers - 

Left = TCAAACTTATAACTGCAGAGAACG 
Right = ATGGTAAACCCTCCCCATTA 
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Product Length = 171 

Review complete sequence: 
AAGACTGCAATTACATTTGC ATCAAACTTATAACTGCAGAGAACG TTGCC 
CACTATTTTATACCACACAACAGTATTCTTAGCCAGATTACATCTATCTAT 
CTATCTATCTATCTATCTATCXATCTATCTATCTATCTATCATCTATCTAGC 
TAGCTATCTATCTATAGAAC TAATGGGGAGGGTTTACCAT GTTTGGGTGA 
ACCCAAACATTTTATGGNCAAGGGNTTGGAAAATTACCCTTATCTACAAC 
TN7TNAACTTGTTTTGGTAGGNGTGNTAATTCCNTGGGNTTGGAANAACT 
TTTGNAATTTCCTCNTTGTTTNTNATTNNNNATTNNTNNNCATTATTNTGG 
GGTNTTCNGGGTGGAGGGCTNANTTTGGCCNCCCGGGTCCNNGGNGC 
NAGTNGGNNNGGNTNNTNGGGTTTNCTTGGGAANCNTNCCNCCTNCNG 
GGGNTTCANGGGN I i I I I NTTTNNTTG 

Genbank ID; G09963 

Description: human STS CHLC.GATA61E10.P17745 clone GATA61E10. 
Search for GOB entry 

23. GCT3G01 

Database ID: GCT-P10825 (Also known as G09484, CHLC.GCT3G01, 
CHLC.GCT3G01.P10825) 

Source: CHLC: genetically mapped polymorphic tetranudeotide repeats 
Chromosome: Chrl8 

Primers: 

Left - CTTTGCAATCTTAGTTAA7TGGC 
Right = GAACTATGATATGGAGTAACAGCG 
Product Length = 123 
Review complete sequence: 

agatgtttaa ctttgcaatcttagttaattggc agaaatgaaatttagttt 

ccacaacttttattcgatattaaaacaccaccaccatcagcagcagcagc 

agcagcagcagcat cgctgttactccatatcatagttc agagcatttaaa 

gn g gtc aaaatat ac aa ctag g ctg ac acc n g n ataag gtttaatt7taa 

accn gnggtctnccctctaaggnggntt i i l 1 1 1'tcttgncntggcttct 

ttttccntttgcttttgtaaaatatcaaggnatttttgggttnttcntggn 

anttnncnnantnntnnttnnncncnccccccntttgnggcgggggtc 

cccnnnttgccccggggttgngtgcagtaggggggtcncgggtnnng 

naagtttnggggccct 

Genbank ID: GQ9434 

Description- human STS CHLC GCT3G01.P10825 clone GCT3G01. 



24. WI-523- 

Database ID. MH232 (A:so known as G03539, 528, D18S828) 
Source: WICGR; Random genome wide STSs 
Chromosome ChrlS 
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Primers: 

Left = TTCTGCCTTTCCTGACTGTC 

Right - TGTTTCCCATGTCTTGATGA 

Product Length = 211 

Review complete sequence: 
CTACTAAG CAAATTCTG CTCAG CC TTCTG C CTTTCCTGACTGTCT TGTTG 
GCCCTTCCCACTTTAAGGATGCCTGTTTAAGTAGCCACCTCTAATTAGGA 
ATCTTCCCTTGTTCTTTCTCAGGAGGCTTAGACACTGTCAGTTTCCTGAA 
GACAGAAAATAAGCCTGCATTATCCTAGTAGTGGATTCAAAACTAATTGT 
GTCCTGAGTCTTTCA ATCATCAAGACATGGGAAACA CTCAACAG 

Genbank ID: G0358S 

Description: WICGR; Random genome wide STSs 
Search for GDB entry 

25. WM783: 

Database ID: MR432 {Also known as G03587, _shu_31.Seq, 1783, 
D18S824) 

Source: WICGR: Random genome wide STSs 
Chromosome: Chr18 

Primers: 

Left = CCAGTAATTAGACATTGACAGGTTC 

Right = TTTTACTAGACAGGCTTGATAAACAA 

Product Length = 305 

Review complete sequence: 
CCAGTAATTAGACATTGACAGGTTC CATACTAGTAATGTAGGGAATAGGG 
CTGCTGCTTTTTGGGTTTCCTTGAGTATACTTTGTGCTGCATAAATATGG 
CAATGGATAGTAAATAATTTGTATGCAGACCTTTAGTGTCGATTAACCTGT 
GA ATAAG GGAACAACAATCAAGGACAAAAATCAAAAGACTAATTCTCTAT 
ACATTTTGAGCTTTTGTAAAAAAGTMGATTAGCTGAATATATCTGAAAAA 
TTTCTAATCTCCTTTACAAI I 1 I I I AAA TTGTTTATCAAGCCTGTCTAGTAA 
AAATAATTCAGTTTCGGAATGTGG 

Genbank ID: G03587 

Description: WICGR: Random genome wide STSs 
Search for GDB entry 



26. D18S477- 

Database ID: AFM301XF5 (AJso known as 301xf5, Z24212, D18S477) 
Source: J Weissenbach, Genethon: genetically mapped polymorphic STSs 
Chromosome. ChrlS 

Primers' 

Left = GGACATCCTTGATTTGCTCATAA 
Right = GATTGACTGAAAACAGGCACAT 
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Product Length = 243 

Review complete sequence: 
GGACATCCTTGATTTGCTCATAA TACACTCATTCCTTTC ACCA TT GAGTG T 
GCACATATTTCTCTGATTGGAAAGAACTACAGAGGAGGTTTTACNTTTTA 
CTTTCCAGTTTGCTATTAAAGAGAGAAAACTAACAGAGNGAAATCAAGCA 
ACTCAAAACAACCTTACACACACACACACACACACACACTCACAAAGATA 
TTTTG7TCACCATATGTATT GATGTGCCTGTTTTCAGTCAATC CACAGGAA 
GGGCTAAGGAGAGTGACATCTGGGCTACATTAAAAGGACAGTCACATTG 
CTC AAAG N ACTCAAGTTTAGCC C GAGTACAGTAGCT 
Genbank ID: 224212 

Description: H. sapiens (D1 8S477) ONA segment containing (CA) repeat; 
clone 

Search for GDB entry 



27. D18S979: 

Database ID: GATA-P28080 (Also known as G08O15, CHLC. GATA92C08, 
CHLC. GATA92C08. P28080) 

Source: CHLC: genetically mapped polymorphic tetranucleotide repeals 
Chromosome: Chr18 

Primers: 

Left = AGCTTGCAGATAGCCTGCTA 

Right = TACGGTAGGTAGGTAGATAGATTCG 

Product Length = 155 

Review complete sequence: 
CTCTACAGTCTCTNACCTTTGGACTCCAGGACTTTCACCAGCACCCTCAA 
CATTCCCACTGGGTTCTCAGGACTTTATAGTTGTACTGAGCCATGCCACT 
GGATCCTAGGGTCTCC AGCTTGCAGATAGCCTGCTA TGGGAC7TAATCT 
TTGTAATAAGGTGAGTCAATTCTGCCAATAAACCTACTTTCATCTCTATCT 
ATCTATCTATCTATCTATCTATCTATCTATATCTATCATCTATCTATC^AAT 
CTATCTACCTACCTACCGTA TTAGTTCTGTCTCTCTGGAGN 

Genbank ID: G08O15 

Description: human STS CHIC.GATA92C08.P28080 clone GATA92C08. 
28. WI-9340' 

Database ID: UTR-05134 (Also known as G06102, D18S1034, 9340, 
X60221) 

Source: WICGR; Pnrr.srs derived from Genbank sequences 
Chromosome: Chrl8 

Primers 

Left = TG AG AG AAC G AAATC TC TATC G G 
Right = AGGCAGCAAGTTTTTATAAAGGC 
Product Length = 115, 
Review complete sequence: 
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ATGTATCTATCCCMTTGAGTCAGCTAGAAACAGTTGACTGACTAAATGG 

AAACTAGTCTATTTGACAAAGTCTTTCTGTGTTGGTGTCTACTGAAGTTAT 

AGTTTACCCTTCCTAAAAATGAAAAGTTTGTTTCATATAG TGAGAGAACGA 

AATCTCTATCGG CCAGTCAGATGTTTCTCATCCTTCTTGCTCTGCCTTTG 

AGTTGTTC C GTGATC ATTCTGAATAAG C ATT TGC CTTTATAAAAACTTGC T 

fiCCTGACTAAAGATTAACAGGTTATAGTTTAAATTTGTAATTAATT 

ATCTTGCAATAAAGTGACAATTGAATG 

GenbankiD: G06102 

Description: WICGR; Random genome wide STSs 
Search for GDB entry 



29, D18S466: 

Database ID: AFM094YE5 (Also known as 094ye5 ( Z23354, D18S466) 
Source; J Weissenbach, Genethon: genetically mapped polymorphic STSs 
Chromosome: Chr18 

Primers; 

Left = ACACTGTAGCAGAGGCTTGACC 

Right = AG G C C AAGTTATGTG C C AC C 

Product Length - 214 

Review complete sequence: 
aaatgacactUaaqqaQQta acactqtaqcaqaqqcttaacc accacccagttctcactaqcactqagq 
atgctctattggttgggttacccacacacgcatagacatgcacacacacagacacacagacacacacac 
acacacacacacaccagatatagcattccaaaccatcaatatgctatgcaatactgcattaacaggtcatg 
cctgl qqtqqcacataacttoQcct aQaaaatactQQQQacQtctQcattcccttttattatcQaattQacttact 
tggcttctgagttttcctcagaagtaatacttcaatacctcttccatttctgccttgancattgtttggggtaccaag 
tatagct 

Genbank ID; Z23354 

Description: H. sapiens (D18S466) DNA segment containing (CA) repeat: 
clone 

Search for GDB entry 



30. D18S1092: 

Database ID: AFMA112WE9 (Also known as D18S1092, w5374, a112we9) 
Source: J Weissenbach, Genethon; genetically mapped polymorphic STSs 
Chromosome: Chrl5 

Primers: 

Left = CTCTCAAAGTAAGAGCGATGTTGTA 
Right = CCGAAGTAGAAAATCTTGGCA 
Product Length = 153 
Review complete sequence: 
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a qctctcaaaqtaaqaqcciatattqta actqactgaqttQttttgtgaanttttgnttttqgaqtcagtggagcat 

gttattagatgtaaatttaaacacacacacacacacacacacacacawcacgagaagtaagtarca^ 

attttctacttcqg cQcctatatttctatatactgattftctgtatttcccagacttgaatatagattfltctttctgntttat 

catagacaatctcataataanUaggcataataaggtaatgaggnttttctgggcttcttttcatcatccctgca 

atttgagtctcnhtalagntgaantcttctctgtaataacntettgttttagct 

Search for GOB entry 
3,1. D18S61: 

Database ID: AFM193YF8 (Also known as 193yf8, Z16834, D18S61) 
Source: J Weissenbach, Genethon: genetically mapped polymorphic STSs 
Chromosome: Chr18 

Primers: 

Left = ATTTCTAAGAGGACTCCCAAACT 
Right = ATATTTTGAAACTCAGGAGCAT 
Product Length = 174 
Review complete sequence: 
CGTCTTACCAAACCAACATAATATAGCAATGGN 

GGACTCCCAAACT ACATTCTTCTNCCTGAATTAAATACAGGCATTCAANA 
NAAACANACACACACACACACACACACACACACACACACACACACGCACA 
C C CTTC AAATC NTAG C AT AAATTC C N CTTATATAAA C ATAAC C ATG CTC CT 
GAGTTTCAAMTATT GGGTGGTTCGMGTTCGAAGCAACAAATTTCCAGT 
TAGTGTCTATTANTTGTTGGACAGCT 

GenbankID: 216834 

Description: H. sapiens (D18S61) DNA segment containing (CA) repeat; 
clone 

Search for GDB entry 
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Markers (STRs) used in refining the candidate region. 

Below the markers are shown that were used in family MAD31 to refine the 
candidate region. Most of these markers are already described above and 
wilt therefore only be mentioned to by their name. For the additional markers, 
the information is given here. 

Data was already shown for: D18S68, D18S55, D18S969, D18S1113, 
D18S483, D18S465, D18S876, D18S477, D18S979, D18S466 and D18S61. 

New data; 

1. D18S51: 

Other names: UT574, (D18S379) 
Primer sequences: 

UT574a GAGC C ATGTTCATGCCACTG 

UT574b CAAACCCGACTACCAGCAAC 

DNA-sequence: 

AATTGAGCNCAGGAGTTTAAGACCAGCCTGGGTAACACAGTGAGACCCC 

TGTCTCTAC AAAAAAATAC AAAAATN AGTTGG GC ATG GTG GC AC GT G C CT 

GTAGTCTCAGCTACTTGCAGGGCTGAGGCAGGAGGAGTTCTTGAGCCCA 

GAAGGTTAAGGCTGCAG TGAGCCATGTTCATGCCACTG CACTTCACTCT 

GAGTGACAAATTGAGACCTTGTCTCAGAAAGAAAGAAAGAAAGAAAGAAA 

GAAAGAAAGAAAGAANGAAAGAAAGAAAGTAAGAAAAAGAGAGGGAAAG 

AAAGAGAAANAGNAAANAAATAGTAGCAACTGTTATTGTAAGACATCTCC 

ACACACCAGAGAAGTTAATTTTAATTTTAACATGTTAAGAACAGAGAGAAG 

C CAACATGTCCACCTTAGGCTGAC GGTTTGTTTATTTGTGTT QTTGCTGG 

TAGTCGGGTTTG TTA1 I 1 1 I AAAGTAGCTTATCCAATACTTCATTAACAAT 

TTCAGTMGTTATTTCATCTTTCAACATAAATACGNACAAGGATTTCTTCT 

GGTCAAGACCAAACTAATATTAGTCCATAGTAGGAGCTAATACTATCACA 

TTTACTMGTATTCTATTTGCAATTTGACTGTAGCCCATAGCCTTTTGTCG 

GCTAAAGTGAGCTTAATGCTGATCGACTCTAGAG 

GEN8ANK ID: L18333 
2. D18S34S. 
Other name: UT575 
Primer Pairs- 
Primer A: TGGAGGTTGCAATGAGCTG 
Primer B: CATGCACACCTAATTGGCG 

DNA sequence, 

ACGAGGACAGGAGTTCAAGACCAGCCTGGCCAACATGGTGAACCCCGTT 
TNTACTAAAANTACAAAANTTGGTCGGGAGGCTGGGGCAGGNGACATGC 
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TTGACCCCAGGAGG TGGAGGTTGCMTGAGCTGA GATTGCACCACTGCA 

CTNCAGCNTGG AAGAAAGAG AAAGGANAGNNAGGNAGNNAN NAAAC 

TACATNTGAAGTCAACACTAGTATTGGTGGGAGAGGAATTTTATGCTGCA 
TTCCCCNACAACCACTAGAT ACGCCAATTAGGTGTGCATG GTCCATGCTA 
T 

GenBank 10: L26588 
3, D18S817. 
Other name: UT6365 
Primer Pairs: 

Primer A- GCAAAGCAGAAGTGAGCATG 
Primer B: TAGGACTACAGGCGTGTGC 



DNA Sequence: 

CATATGGGTCCACAAGCAACCTCAGTCCTTGTCTCTTCAGAAGAAAGAAT 

TCTACTGAGGGNCATAAGGCAGAAGGAGAGACCTAGGCAAGTTGCAAAS 

CAGAAGTGAGCATG TATTAAAAAGCTTTAGAACAGTMGGAAAGGAAGAA 

AAGAAAAGAAGGAAAGTTCAACTTGGAAGAGGGCCAAGCCGGCAACTTG 

GCAGAAGGATTGCTTGAGCCCAGGAGTTAAGACCAGTCTGGGCAATATA 

GTGAGACTCCATCTCTGCATACATACATACATACATACATACATACATACA 

TACATACATATTGCAGGGTATGATG GCACACGCCTGTAGTCCTA GCTACT 

CTGGAGGTTGAGATGGGAGGGTCACTGAGCCTGGGAANTTGAGGCTGC 

NNTGAGCCATGATC 

GenBank ID: L30552 
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Characterisation of YACs. 

8 YACs were selected covering the candidate region and flanking the gap. 
These YACs were further characterised by determining the end-sequences 
by the Inverse-PCR protocol. 

Selected YACs: 961_h 9. 942 c 3, 766 f 12, 731 c 7, 907 ej, 752 _g 8, 
717 _<L3. 745_d_2 

New STSs based on end-sequences (unless indicated otherwise, the STSs 
were tested on a mcnochromosomal mapping pannel for identifying 
chimaerism of the YAC; if the STS revealed a hit not on chromosome 18q - 
chimaeric YAC- then it is indicated in the text below): 



1. SV32L 

Derived from YAC 745_d_2 left arm end-sequence. 

Primer A: G TTATTAC AATGTC AC C C TC ATT 
Primer B: ACATCTGTAAGAGCTTCACAAACA 

DNA-sequence: 

ATTC C TTN GTTATTAC AATGTC AC CCTCATTT AAAAAGTGGAAAGATAAAG 
AGGAAGCAATCTA1 II N M CC I I I N II CTGATAG CAC TTGTTTGTGAAG 

ctcttacagatgtt cttaagtaaaatcaactcctcca i urn ngtagca 
actacacatatttatcmtaatagttcacaaatacattttcaaatt 

Amplified sequence length: 107 basepairs (bp) 

This STS has no clear hit on the monochrcmosomal mapping pannel. 



2. SV32R. 

Derived from YAC 745j3_2 right arm end-sequence. 

Primer A: ACGTTTCTCAATTGTTTAGTC 
Primer B: TGTCTTGGCATTATTTTTAC 

DNA sequence: 

agacaatgggagaaattgcactgccctgagtcagaaatcagatctgttg 

ccatacagctgccgttatgtgatcatttgcaagtc aacgtttctcaattg 

IJtagjcatttgtaagacaaaaagactggttggatttcagagaatttgga 

atcctccttcaggtttaacaagcaataaatgatactcttcag tgtaaaaat 

mtgccaagaca tnatttgactttaaattaaatccaaacaagatatc 

Amplified sequence length: 127 bp 
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This STS has no clear hit on the monochromosomal mapping pannel. 
3. SV11L 

Derived from YAC 766_f_12 left arm end-sequence. 

Primer A: CTATGCTCTGATCTTTGTTACTTT 
Primer B: ATTAACGGGAAAGAATGGTAT 

DNA sequence: 

GTCTTTATTTCATATM CTATGCTCTGATCTTTGTTACTTT CTCCTTTTAAC 
TCAGTTTMGCTTTATTCTTATTTTCCAGCTGCTGAAGGTATATAGTTAGG 
TTGTTTATTGG ATACCATTCTTTCCCGTTAAT GTCAGTGGTTACTGCTATC 
AATGTAGCAGTTA 

Amplified sequence length: 1 1 8 bp 

This STS has a hit with chromosome 18 and must ba located between 
CHLC.GATA-p605l and D18S968. 



4. SV11R. 

Derived from YAC 766.M2 right arm end-sequence. 

Primer A: AAG GTATATTATTTGTGTC G 
Primer B: AAACTTTTCTTAACCTCATA 

DNA sequence: 

A TMGGTATATTAmGTGTCG TGAGTTMGAAATCATTAATAACTATTTT 

CAGMTGACAAATGTCATTATATGTTGTAAAAAAGATAAATACGTGAAATT 

ATGAGGTTAAGAAAAGTTTA 

Amplified sequence length: 1 1 9 bp. 

This STS has a hit with chromosome 18 and must be located between 
D18S876 and GCT3G01. 



5. SV34L. 

Derived from YAC 71?_d_3 left arm end-sequence. 

Primer A; TCTACACATATGGGAAAGCAGGAA 
Primer B: GCTGGTGGTTTTGGAGGTAGG 
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DNA sequence: 

acataaaatgtcgctcaaaaacaattatgtgt gtctacacatatgggaaa 
gcaggaaa caaatttgtttacaacatacattacttttgttttttag gc aag 
ataaaatn tcctacctccaaaaccaccagca cngtccgcaataactatac 

ATC 

Amplified sequence length: 98 bp 

This STS has a hit with chromosome 18. 



6. SV34R. 

Derived from YAC 7l7_d_3 right arm end-sequence. 

Primer A: ATAAGAGAC CAGAATGTGATA 
Primer B: TCTTTGGAGGAGGGTAGTC 

DNA-sequence: 

AATATCATTCTTCACCCACGTTATAC ATAAGAGACCAGAATGTGATA TTGT 

CATCTCACATGGAAAAATCTGCTGTGATCAGTTCCTGAAGCTTGCTGTGA 

TC CTC C CTTAG G AMGTAG AAAAATCTTTTTGAAAC ACTTTATTCTACAAT 

CAATGAAAATTAGGTGAAGCTACAGAAGCCAGAAATTACTCTAAGATTAG 

ACAATTATTTAAGANGACCAATTGTCTTTGGTCTTCTTCTGAAGGGTCTG 

ACTACCCTCCTCCAAAGA ATTCACTGGCCGTCGTTTTACAACGTCNTGA 

Amplified sequence length: 244 bp 

This STS has a hit with chromosome 1, therefore YAC 717_d_3 is chimaeric 



7. SV25L. 

Derived from YAC 731_c_7 left arm end-sequence. 

Primer A: AAATC TC TTAAG CTC ATG CTAGTG 
Primer B: CCTGCCTACCAGCCTGTC 

DNA sequence: 

AGTGGAGAGATAGAAAGAGAGGAAGA I I I I I ) I I I I I AAATCTCTTAAGCT 
C ATGC TAG TG TAGG TG CTGGC AGGTCTGAAC ACTCTGTAG GAC AGGCTG 
GTAGGCAGG AA 

Amplified sequence length: 72 bp 

This STS has no dear hits on the monochrornosomai mapping pannel. 
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8. SV25R. 

Derived from YAC 731 js_7 right arm end-sequence. 

Primer A: TGGGGTGCGCTGTGTTGT 
Primer B: GAGATTTCATGCATTCCTGTAAGA 

DNA-sequence: 

GGAGGGTGTTNTCACANAAGTC TGGGGTGCGCTGTGTTGT TCATTGTAA 
AAACCCTTTGGANCATCTGGGAATGTGCTGCCCCACATGTCCAGGTAAC 
GTTCTCAGGAAGGGGAGGCTGGAAATCTCTGTGTG TTCTTACAGGAATG 
CATOAAATCTCCCANCCCCTCTTGTTGGAAATTTCCCTCACTTT 

Amplified sequence length: 136 bp 

This STS has a hit with chromosome 7; therefore YAC 731_c_7 is chimaeric 



9. SV31L 

Derived from YAC 752_g_8 left arm end-sequence. 

Primer A: GAGGCACAGCTTACCAGTTCA 
Primer B: ATTCATTTTCTCATTTTATCC 

DNA-sequence: 

CTTCTCNATGANTGCACAAATGTCATTGGGTCAGCA TGAGGCACAGCTT 
ACCAGTTCA GATTCCAGTAGCTGAGGAACAAATCTTAACTCCAAAAATAA 
GTAATTGC GTC ACTTTG GAG GAATTATTTG AC CTTTTCATAACTTTG AC AT 
C AC AAC AATG AGG G TG AAGTTAGTAAMTAAATGATTATTATGAGGATAA 
AATGAGAAAATGAAT TNAGTGCTTAAGACAATGCTTGGTAACTAGTTAAN 
CCG 

Amplified sequence length: 1 78 bp 

This STS has a hit with chromosome 18 and must be located between 
D18S876 and GCT3G01. 



10. SV31R. 

Derived from YAC 752_g_8 right arm end-sequence. 

Primer A: CAAGATTATGCCTCAACT 
Primer B: TAAGCTCATAATCTCTGGA 
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DNA sequence: 

AAACTTTAACCAATTTAAACTCCCTAACAGTrCTATAAAATAAGCAAGATT 

ATGCCTCAACTT TATGGATAAAGAAATGGAGGCATTAAGAGATAACTAAC 

TTGCCCAAGGCCACACAAGTGACTGAGTAAGAATTGCAAAGCCAATGAG 

TCTGG CTCCAGAGATTATGAGCTTA ATCACCACACTGTGCCACCTCCTGT 

GTTTCCTGG 

Amplified sequence length: 131 bp 

This STS has no clear hits on the monochromosomat mapping pannel and 
gives no information concerning the chimaerity of the YAC. 



11. SV1QL 

Derived from YAC 942_c_3 left arm end-sequence. 

Primer A: TCACTTGGTTG GTTAAC ATTACT 
Primer B: TAGAAAAACAGTTGCATTTGATAT 

DNA-sequence: 

GGTNT TTCACTTGGTTGGTTAACATTACTT CTAAG I 1 I I I IATTGI I II I 1A 
TGCTATTGCTAATGGGATTGCTTTCTTAATTTA I I I 1 1 1 CCAATAGCTTGT 
TGTTAGTT TATATCAMTGCMCTGTTTTTCTA TGCAMTTATGTTTCCT 

Amplified sequence length: 130 bp 

This STS has a hit with chromosome 18 and must be located between 
CHLC.GATA-p6051 and 018S963 

12. SV10R. 

Derived from YAC 942_c_3 right arm end-sequence. 

Primer A: AACCCAAGGGAGCACAACTG 
Primer B: GGCAATAGGCTTTCCAACAT 

DNA sequence: 

TTGGTGGTGCCCTAGGTTTGGCAATTATAAATAAAGCTGCTACAAACATT 
CATGTGCAGGTCTCCGTGTGGACATMTTTTCCAGTTCATTTGGGTAAAA 
CCCAAGGGAGCACAACTG TTGGATCCTATNATAAAAATATNTCTCGTTTC 
ATTTAAAAAACCTGGGAAACTATCTNCCCACAGTGGCTGTCCCTTTTTGT 
ATCCCCACCAACAATGTTGGAAAGCCTATTGCCANCAT 



Amplified sequence length: 135 bp 
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This STS has a hit with chromosome 1 8 and must be located between 
018S876and GCT3G01 



13. SV6(,. 

Derived from YAC 961_h_9 left arm end-sequence. 

No primer was made, because this sequence is identical to a known STR 
marker D18S42, which is indeed mapped to this region. 

Primer A: 
Primer B: 

DNA sequence: 

CATGNCTCACAGTGTTCTGAGGCTGCTCTGGACATGCAATCTTGCATGC 
TTTTGTCATGACAGGTCTTAAANAGTTTATCAGCTTNCTCAAATAGCTGAA 
TGACANAACACTGGA I I I I I GTTCAAATANCCTATCAACTTGGCNTCTGT 
GTTGCGGTTGTCACTTG GTAAC AAAATAAGTC 

Amplified sequence length: 

SV6L recognises 018S42 which must be therefore located between WI-7336 
and WJ-8145 



14. SV6R. 

Oerived from YAC S61_h_9 right arm end-sequence. 

Primer A: TTGTGGAATGGCTAAGT 
Primer B: GAAAGTATCAAGGCAGTG 

DNA sequence: 

TMTTGACAMTAAAAATTGTATATTTTNCATATTTAACATGTTATGCTAAC 

ATATATATGGA TTGTGGAATGGCTAAGT CAGAAATTCTTTTACATTCATAT 

TTCC ATATTATTTACTTTNNGCTTTAAAAAATATGTAAATGANAATACTTAT 

I I II I I CAGTG TCACTGCCTTGATACTTTCA CATTTNNGTTACATATTATTT 

CCCTTNCATCTAACAAATATATATTGAGTTTCTATAATGTGTCTGACACTG 

A 

Amplified sequence length: 122 bp 

SV6R amplifies a segment on chromosome 18. This segment must be located 
between WI-2620 and WI-421 1 
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15. SV26L. 

Derived from YAC 907_e_J left arm end-sequence. 

Primer A: TATTTGGTTTGTTTGCTGAGGT 
Primer 8: CAAGAAGGATGGATACAAACAAG 

DNA sequence: 

TGGTCACTGGTGCCT TATTTGGTTTGTTTGCTGAGGT CATATTTCCTGTG 
GCCTTCATGCTTGATTTGTTGGAGTCTAGCCATGTAAAANTCTGTTGGAG 
TCTAGGCATTTAAAAAATAGGTATTTATTGTAATCTTTGCCATTTGCXLGT 
TTGTATCCATCCTTCTTG GGAAGGCTTTACAGGCATTCAAAAGG 



Amplified sequence length: 154 bp 

This ST$ has a hit with chromosome 13; therefore YAC 9Q7_e_1 is 
chimaeric. 



16. SV26R. 

Derived from YAC 907_e_1 right arm end-sequence. 

Primer A: CGCTATGCATGGATTTA 
Primer B: GCTGAATTTAGGATGTAA 

DNA sequence: 

C G CTATG C ATG G ATTTAA ACTGAGTGTAGTG C AC TCACTATGTTGC AGTC 
TCTTATTCTAGGTTCCTAATA TTTACATCCTAAATTCAGCT 

AmpHfied sequence length: 90 bp 

no clear hits on moncchromosomal mapping pannel: no information 
concerning chaemerity at this side of the YAC 
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Testing of 3 end-sequences flanking the gap in additional YACs; STS- 
markers Wi-4211, D18S87S and GCT3G01 are afso shown in order to 
identify YACs on opposite sides of the gap more clearly in table 3 below. 



10 



YACs 


STSs 


WM211 


D18S876 


SV31L 


SV11R 


SV10R 


GCT3G01 


940 b 1 


+ 




+ 








766 f 12 


+ 




+ 








846~a~5 


+ 




+ 


+ 






752~0-8 






+ 


+ 






745 d 2 




+ 










sere" 




+ 










942 c 3 






+ 


+ 


+ 




717 d 3 






+ 




-? 


+ 


972 e 11 














940jTlO 










+ 


+ 


821 e 7 










+ 


+ 


731_c_7 














889V4 










+ 


+ 


907 e 1 












+ 



* +: positive hit / -: no hit / ?: 2 instances were observed in which a positive 
hit was expected (on the assumed order of the markers) but not 
25 observed. The reasons for this are not dear. 

YA , 745.d.2 was excluded from further analysis since there was no clear 
Y t with chromosome 18. Of the remaining 7 from a monochromosomal 
mapping panel it was determined that 3 were chimeric and 4 non- 
30 chimeric as shown in Table 4 below. 
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TABLE 4 



YAC 


chimaeric 


chromosome 


s 961 _h_9 (6) 


no 




942lc_3 (10) 


no 




766 J J2 (11) 


no . 




731~c~7 (25) 


yes 


chromosome 7 


907 _eJ (26) 


yes 


chromosome 13 


lo 752_g_8 (31) 


no 




717 d_3 (34) 


yes 


chromosome 1 



For the non-chimeric YACs the STS based on the end- 
is sedquence flanking the gap (1 OR, 1 1 R, 3 1 L) was tested 
on 14 YACs flanking the gap. Overlaps between YACs 
on opposite sides of the gap were demonstrated: e.g. the 
"11 R" end-sequence (766 f.12) detects YAC 766.f.12 
and YAC 907.e.1. 
20 YACs were then selected comprising the minimum tiling 
path: 

TABLE 5 

25 



YAC 


size 


chimaerity 


961_h_9 


1180 kb 


not chimaeric 


766_M2 


1620 kb 


not chimaeric 


907 e 1 


1690 kb 


chimaeric (chr. 13) 



30 

These three YACs are stable as determined by PFGE 
and their sizes roughly correspond to the published 
sizes. These YACs were transferred to other host- 
35 yeast strains for restriction mapping. 
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Experimental 2 

Construction of fragmentation vector: 

5 A 4,5kb ECORI/Sall fragment of pBLCS.l (Lewis et 

al, 1992) carrying a lysine-2 and a telomere sequence 
was directionally cloned into GEM3zf (-) digested with 
ECORI/Sall. Subsequently, an End Rescue Site was 
ligated into the EcoRI site. Hereto, two 

10 oligonucleotides ( strand l: S'-TTCGGATCCGGTACCATCGAT- 
3' AND STRAND 2: S'-GCCTAGGCCATGGTAGCTATT-S') were 
ligated into a partial ( dATP } filled ECORI site, 
generating the vector pDFl. Triplet repeat containing 
fragmentation vectors were constructed by cloning of a 

15 21bp and a 3 0bp CAG/CTG adapter into the Klenow-f illed 
PstI site of pDFl- Trasformation and selection 
resulted in a (CAG) 7 and a (CTG) 10 fragmentation vector 
with the orientation of the repeat sequence 5' to 3' 
relative to the telomere* 

20 

Yeast transformation : 

Linearised (digested with Sail) vector was used 
to transform YAC clones 961X9, 766_f,12 or 907_e.l 

25 using the LiAc method. After transformation the YAC 
clones were plated onto SDLys* plates to select for 
the presence of the fragmentatio vector. After 2-3 
days colonies were replica plated onto SDLys"-Trp"-Ura" 
and SDLys'-Trp"-Ura r plates. Colonies growing on the 

30 SDLys"-Trp"-Ura + plates but not on the SDLys"-Trp"-Ura" 
plates contained the fragmented YACs. 

Analysis of fragmented YACs : 
35 Yeast DNA isolated from clones with the correct 
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phenotype was analysed by Pulsed Field Electrophoresis 
(PFGE) , followed by blotting and hybridisation with 
the Lys-2 gene and the sizes of the fragmented YACs 
were estimated by comparison with DNA standards of 
5 Known length. 

End Rescue : 

Fragmented YACs characterised by a size, common to 
10 other fragmented YACs, indicative of the presence of a 
major CAG or CTG triplet repeat, were digested with 
one of the enzymes from the End Rescue site, ligated 
and used to transform £. Coli, After growth of the 
transformed bacteria the plasmid DNA was isolated and 
15 the ends of the fragmented VACs, corresponding to one 
of the sequences flanking the isolated trinucleotide 
repeats f were sequenced. 

Sequencing revealed that fragmented YACs of an 
equal length were all fragmented at the same site. A 
20 BLAST Search of the GenBank database was performed 
with the identified sequences to identify homology 
with known sequences. The complete sequence spanning 
the CAG or CTG repeats of the fragmented YACS was 
obtained by Cosmid Sequencing, employing sequence 
25 specific primers and splice primers, as previously 
described (Fuentes et aJ . 1992 Hum. Genet. 101: 346- 
350} or by using the "genome walker" kit (clontech 
Laboratories, Palo Alto, USA) and described in Siebert 
et al. Nucleic Acid Res (1995) 23(6): 1087-1088 and 
30 Siebert et al . (1995) CLONTECHniques X(II) : 1-3, 

Results: 

A YAC 961. h.9 clone was transformed with the 
35 (CAG) 7 or (CTG) 10 fragmentation vector. The CTG vector 



JUN 14 2000 10U9 



020 7831 1768 PAGE. 65 



14-j;UN-2000 15:04 FROn BOULT WADE TENNANT 



TO 0016177202441 



P. 66 



- 66 - 



did not reveal the presence of any ctg repeat. 
Analysis of twelve (CAG) 7 fragmented YACs showed that 
five of these had the same size of approximately 
lOOkb, End Rescue was performed with ECORI and 
5 sequencing of three of these fragments revealed that 

they all shared the terminal sequence shown in italics 
in Figure 15a* A BLAST * search of the Genbank database 
with this sequence indicated the presence of a 
sequence homology with the CAPS gene (GenbBank 
10 accession number: L40377) . The sequence spanning the 
CAG repeat shown in Figure 15a was obtained by both 
cosmid sequencing and genome walker sequencing - The 
sequence was mapped between markers D18S63 and Wl-3170 
by STS content mapping. 

15 

A YAC 766-f-12 was fragmented using the 
(CAG) 7 or (CTG) 10 fragmentation vector. Again the 
(CTG) 10 vector did not reveal the presence of any CTG 
repeat- Analysis of twenty (CAG) 7 fragmented YACs 

20 showed the presence of two groups of fragments with 
the same size: five of approximative ly 650kb and two 
of approximatively 50kb. 

End Rescue was performed using ECORI on four of 
the fragmented YACs of 650kb. Sequencing confirmed 

25 that they all shared identical 3' terminals, 

characterised by the sequence shown in italics in 
Figure 16a. A Blast Search showed homology of this 
sequence with the Alu repeat sequence family. The 
sequence spanning the CAG repeat shown in Figure 16a 

30 was obtained by cosmid sequencing. The sequence was 
mapped between markers WI-2620 and WI-4211 by STS 
content mapping on the YAC contig map. 
End Rescue was also performed on the two fragments of 
50kb. Sequencing revealed the sequence shown in 

35 italics in figure 17 a. A Blast Search revealed no 
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sequence homology with any known sequence. Cosmid 
sequencing allowed to identify the complete sequence 
spanning the CAG repeats, shown in figure 17a, The 
sequence was mapped between markers D18S968 and 
5 D18S875 by STS content mapping on the YAC contig map, 

A YAC 907-e-l clone was transformed with the 
(CAG) 7 or {CTG} 10 fragmentation vector- The (CAG} 7 
vector did not reveal the presence of any CAG repeat. 

10 Analysis of twenty-six (CTG) 10 fragmented YACs revealed 
that twenty-one of them had the same size of 
approximatively 900kb. End Rescue was performed with 
Kpnl on three fragmented YACS of this si2e. Sequencing 
revealed the nucleotide sequence shown in italics in 

15 Figure 18a. A Blast Search indicated the presence of 
an homology of this sequence with the GCT3G0I marker 
(GenBank accession number: G094S4) . The sequence 
spanning the CTG repeat was obtained from the GenBank 
Database* The sequence was mapped between markers 10R 

20 and WI-528. 



25 



30 



35 
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CLAIMS i 

1. Use of an 8.9 cM region of human chromosome 
ISg disposed between polymorphic markers D18S63 and 
5 D18S979 or a fragment thereof for identifying at least 
one human gene, including mutated or polymorphic 
variants thereof, which is associated with a mood 
disorder or related disorder, 

10 2. Use of a YAC clone comprising a portion of 

human chromosome 18q disposed between polymorphic 
markers D18S60 and D18S61 for identifying at least one 
human gene, including mutated or polymorphic variants 
thereof, which is associated with a mood disorder or 

15 related disorder. 

3- The use as claimed in claim 2 wherein said 
portion comprises the region of chromosome I8g between 
polymorphic markers D1BS6S and D18S979 or a fragment 
20 of said region, 

4. The use as claimed in claim 2 or 3 wherein 
said YAC clone is 961. h.9, 942. c„3, 766. t m 12, 731. c. 7, 
907 v e.l, 752-g-8 or 717. d^3. 

25 

5. The use as claimed in claim 4 wherein said 
YAC clone is 96l.h_9, 766. f. 12 or 907. e„l. 

6. The use as claimed in any preceding claim 
30 wherein said mood disorder or related disorder is 

selected from the Diagnostic and Statistical Manual of 
Mental Disorders, version 4 (dsm-IV) taxonomy and 
includes mood disorders (296. XX, 300.4, 311, 301, 13, 
295.70), schizophrenia and related disorders (295, 
35 297.1, 293.9, 297.3, 298.9), anxiety disorders 
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(300-XX, 309.81, 308,3} , adjustment disorders (309, 
XX) and personality disorders (codes 301- XX) • 

7 , A method of identifying at least one human 
5 gene, including mutated or polymorphic variants 

thereof , which is associated with a mood disorder or 
related disorder which comprises detecting nucleotide 
triplet repeats in a region of human chromosome I8g 
disposed between polymorphic markers D18S68 and 
10 D18S979. 

8, A method of identifying at least one human 
gene, including mutated or polymorphic variants 
thereof / which is associated with a mood disorder or 

15 related disorder which comprises fragmentation of a 
YAC clone as defined in any one of claims 2 to 4 and 
detection of nucleotide triplet repeats. 

9, A method as claimed in claim 7 or 8 wherein 
20 said repeated triplet is CAG or CTG. 

10* A method as claimed in claim 9 wherein said 
repeated triplet is detected by means of a probe 
comprising at least 5 CTG and/or CAG repeats, 

25 

11. A method of identifying at least one human 
gene including mutated or polymorphic variants 
thereof, which is associated with a mood disorder or 
related disorder wherein said gene is present in the 

30 DNA comprised in the YAC clones as defined in any one 
of claims 2 to 5, which method comprises the step of 
detecting an expression product of said gene with an 
antibody capable of recognising a protein with an 
amino acid sequence comprising a string of at least 8 

35 continuous glutamine residues. 
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12. A method as claimed in claim 11 wherein said 
DNA forms part of a human cDNA expression library. 

13. A method as claimed in claim 11 or claim 12 
5 wherein said antibody is mAB 1C2. 

14. A method of preparing a contig map of YAC 
clones of the region of human chromosome I8q between 
polymorphic markers D18S60 and D18S61 which comprises 

10 the steps of: 

(a) subcloning the YAC clones according to 
any one of claims 2 to 5 into exon trap vectors; 

15 (b) using the nucleotide sequences shown in 

any one of Figures l to 11 or any other known sequence 
tagged sequence from the YAC contig described herein, 
or part thereof consisting of not less than 14 
contiguous bases or the complement thereof, to detect 

20 overlaps among the cosmid vectors, and 

(c) constructing a cosmid contig map of a 
YAC clone of said region. 

25 15, A method of identifying at least one 

human gene or any mutated or polymorphic variant 
thereof which is associated with a mood disorder or 
related disorder which comprises the steps of: 

30 (a) transfecting mammalian cells with DNA 

sequences cloned into an exon trap vector as prepared 
in claim 14; 

(b) culturing said mammalian cells in an 
35 appropriate medium; 
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(c) isolating RNA transcripts expressed from 
an SV40 promoter; 

(d) preparing cDNA from said RNA 
5 transcripts; 

(e) identifying splicing events involving 
exons of the DNA subcloned into said exon trap vector 
in accordance with claim 14 to elucidate positions of 

10 coding regions in said subcloned DNA; 

(f) detecting differences between said 
coding regions and equivalent regions in the DMA of an 
individual afflicted with said mood disorder or 

15 related disorder; and 

(g) identifying said gene or mutated or 
polymorphic variants thereof which is associated with 
said mood disorder or related disorder. 



20 



25 



16. A method of identifying at least one human 
gene or mutated or polymorphic variants thereof which 
is associated with a mood disorder or related disorder 
which comprises the steps of; 

(a) subcloning the YAC clones according to 
any one of claims 2 to 5 into a cosmid, BAC, PAC or 
other vector; 



30 (b) using the nucleotide sequences shown in 

any one of Figures 1 to 11 or any other known sequence 
tagged sequence from the YAC contig described herein, 
or part thereof consisting of not less than 14 
contiguous bases or the complement thereof, to defect 

35 overlaps amongst the subclones and construct a map 
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thereof; 

(c) identifying the position of genes within 
the subcloned DNA by one or more of CpG island 

5 identification, zoo-blotting, hybridization of said 

subcloned DNA to a cDNA library or a Northern blot of 
tnHNA from a panel of culture cell lines; 

(d) detecting differences between said genes 
10 and equivalent regions of the DNA of an individual 

afflicted with a mood disorder or related disorder; 
and 

(e) identifying said gene which, if 

15 defective, is associated with said mood disorder or 
related disorder, 

17. An isolated human gene, including mutated or 
polymorphic variants thereof f which is associated with 
20 a wood disorder or related disorder which is 

obtainable by the method according to any of claims 7 
to 13, 15 or 16, 

IS. A human protein which, if defective, is 
25 associated with a mood disorder or related disorder 

which is the expression product of the gene according 
to claim 17. 

19. A cDNA encoding the protein of claim 18 which 
30 is obtainable by the method of any one of claims 7 to 

13 , 15 or 16. 

20. Use of a probe of at least 14 contiguous 
nucleotides of the cDNA of claim 19 or the complement 

35 thereof in a method for detection in a patient of a 
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pathological mutation or genetic variation associated 
with a mood disorder or related disorder which method 
comprises hybridi2ing said probe with a sample from 
said patient and from a control individual. 

5 

21. A nucleic acid molecule which comprises a 
sequence of nucleotides as shown in any one of Figures 
15a, 16a, 17a or 18a. 

10 22. A nucleic acid molecule which comprises a 

sequence of nucleotides which differ from a sequence 
of nucleotides as shown in any one of Figures 15a , 
16a, 17a or 18a only in the extent of trinucleotide 
repeats. 

15 

23. A protein encoded by a nucleic acid molecule 
as claimed in claim 21. 

24. A protein encoded by a nucleic acid molecule 
20 as claimed in claim 22* 

25. A method of determining the susceptibility 
of an individual to a mood disorder or related 
disorder which method comprises analysing a sample of 

25 DNA from that individual for the presence of a DNA 
polymorphism associated with a mood disorder or 
related disorder in a region of chromosome I8q 
disposed between polymorphic markers D18S6S and 
D18S979. 

30 

26. A method as in claims 25 wherein said DNA 
polymorphism is a trinucleotide repeat expansion. 

27. A method as in claim 26 wherein said 

35 trinucleotide repeat expansion is comprised in a 
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sequence of nucleotides that differ from the sequence 
of nucleotides shown in any one of Figures 15a, 16a, 
17a or 18a only in said trinucleotide repeat 
expansion. 

5 

28. A method as in claim 26 or 27 which comprises 
the steps of: 

a) obtaining a DNA sample from said 
10 individual; 

b) providing primers suitable for the 
amplification of a nucleotide sequence comprised in 
the sequence shown in any one of Figures 15a, 16a, 17a 

15 or 18a said primers flanking the trinucleotide repeats 
comprised in said sequence; 

c) applying said primers to the said DNA 
sample and carrying out an amplification reaction; 

20 

d) carrying out the same amplification 
reaction on a DNA sample from a control individual; 
and 

25 e) comparing the results of the 

amplification reaction for the said individual and for 
the said control individual; 

wherein the presence of an amplified 
30 fragment from said individual which is bigger in size 
from that of said control individual is an indication 
of the presence of a susceptibility to a mood disorder 
or related disorder of said individual. 

35 29. A method as in claim 28 wherein said 
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nucleotide sequence to be amplified is comprised in 
the sequence shown in Figure 15a and said primers have 
the sequences shown in Figure 15b. 

5 30. A method as in claim 28 wherein said 

nucleotide sequence to.be amplified is comprised in 
the sequence shown in Figure 16a and said primers have 
the sequences shown in Figure 16b, 

10 31. A method as in claim 28 wherein said 

nucleotide sequence to be amplified is comprised in 
the sequence shown in Figure 17a and said primers have 
the sequences shown in Figure 17b. 

15 32 * A method as in claim 23 wherein said 

nucleotide sequence to be amplified is comprised in 
the sequence shown in Figure iSa and said primers have 
the sequences shown in Figure 18b. 

20 33 * A method of determining the susceptibility 

of an individual to a mood disorder or related 
disorder which method comprises the steps of : 

a) obtaining a protein sample from said 
25 individual; and 

b) detecting the presence of the protein of 

claim 24; 

30 wherein the presence of said protein is an 

indication of the presence of a susceptibility to a 
mood disorder or related disorder of said individual. 

34. A method as in claim 33 wherein said protein 
35 is detected with an antibody which is capable of 
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10 



recognising a string of at least 8 continuous 
glutamines. 

35* A method as in claim 3 4 wherein said 
antibody is mAB 1C2. 

36, A nucleic acid as claimed in claim 21 for use 
as a medicament in the treatment of a mood disorder or 
related disorder. 

37. A protein as claimed in claim 23 for use as a 
medicament in the treatment of a mood disorder or 
related disorder - 



15 38, A pharmaceutical composition which comprises 

a nucleic acid as claimed in claim 21 and a 
pharmaceutical^ acceptable carrier. 

39, A pharmaceutical composition which comprises 
20 a protein as claimed in claim 23 and a 

pharmaceutical^ acceptable carrier, 

40, An expression vector which comprises a 
sequence of nucleotides as claimed in claims 21 or 22* 

25 

41. A reporter plasmid which comprises the 
promoter region of a nucleic acid molecule as claimed 
in claim 21 or 22 positioned upstream of a reporter 
gene which encodes a reporter molecule so that 

30 expression of said reporter gene is controlled by said 
promoter region. 

42. A cell line transfected with the expression 
vector of claim 40, 

35 
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43. An eukaryotic cell or multicellular tissue or 
organism comprising a transgene encoding a protein as 
claimed in claims 23 or 24. 

5 44. A method for determining if a compound is an 

enhancer or inhibitor of expression of a gene 
associated with a mood disorder or related disorder 
which comprises the steps of; 

10 a) contacting a cell as claimed in 

claim 42 with said compound; 

b) detecting and/or quantitatively 
evaluating the presence of any mRNA transcript 

15 corresponding to a nucleic acid as claimed in claim 21 
or 22; and 

c) comparing the level of transcription 
of said nucleic acid with the level of transcription 

20 of the same nucleic acid in a ceil as claimed in claim 
42 not exposed to said compound; 

45. A method for determining if a compound is an 
enhancer or inhibitor of expression of a gene 
25 associated with a mood disorder or related disorder 
which comprises the steps of: 

a) contacting a cell as claimed in claim 42 
with said compound; 

30 

b) detecting and/or quantitatively 
evaluating the expression of a protein as claimed in 
claims 23 or 24 and 

35 c) comparing the level of expression of said 
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protein with that of the same protein in a cell not 
exposed to said compound, 

46. A method for determining if a compound is an 
5 enhancer or inhibitor of expression of a gene 

associated with a mood disorder or related disorder 
which comprises the steps of: 

a) contacting a cell transfected with a 
10 reporter plasmid as claimed in claim 41 with said 

compound ; 

b) detecting or quantitatively evaluating 
the amount of reporter molecule expressed; and 

15 

c) comparing said amount with the amount of 
expression of said reporter molecule in a cell 
comprising said reporter plasmid and not exposed to 
said compound* 

20 

47. A compound identified as an enhancer or an 
inhibitor of the expression of a gene associated with 
a mood disorder or related disorder by a method as 
claimed in claims 44 to 46. 

25 



30 
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SEQUENCE LISTING 



<110> VLAAMS XNTERUN I VERSI TAI R 2NSTITUUT VOOR BIOTECHNOLOGIE 

<120> MOOD DISORDER GENE 

<130> 48464/000 

<140> 
<141> 

<160> 23 

<170> PatentXn Ver. 2-0 

<210> 1 

<211> 167 

<212> DNA 

<223> Homo sapiens 

<400> 1 

gtctttattt catataacta tgctctgatc tttgttactc tctcctttta actcagtcta 60 
agctttattc ttattttcca gctgctgaag gtatatagtt aggttgttta ttggatacca 120 
ttctttcccg ttaatgtcag tggttactgc tatcaatgta gcagtta 167 

<210> 2 

<211> 122 

<212> DNA 

<213> Homo sapiens 

<400> 2 

ataaggtata ttatttgtgt cgtgagttaa gaaatcatta ataactattt tcagaatgac 60 

aaatgtcatt atatgttgta aaaaagataa atacgtgaaa ttatgaggct aagaaaagtt 120 
ta 122 

<210> 3 

<211> 154 

<212> DNA 

<213> Homo sapiens 

<40Q> 3 

acataaaatg tcgctcaaaa acaattatgt gtgtctacac atatgggaaa gcaggaaaca 60 
aatttgttta caacatacat tacttttgtt ttttaggcaa gataaaatnt cctacctcca 120 
aaaccaccag cacngtccgc aaraactata catc 154 

<210> 4 

<2Z1> 301 

<212> DNA 

<213> Homo sapiens 



aatatcattc ttcacccacg ttatacataa gagaccagaa tgtgatattg tcatctcaca 60 

tggaaaaatc tgctgtgatc agttcctgaa gcttgctgtg atcctccctt aggaaagtag 120 

aaaaatcttt ttgaaacact ttattctaca atcaatgaaa attaggtgaa gccacagaag 180 

ccagaaatta ctctaagatt agacaattat ttaagangac caattgtctt tggtcttctt 240 

ctgaagggtc tgactaccct cctccaaaga attcactggc cgtcgtttta caacgtcntg 300 



<210> 5 

<211> 191 

<2I2> DNA 

<213> Homo sapiens 

<400> 5 



<400> 4 



a 



301 
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ggagggtgtt ntcacanaag tctggggtgc 
ancatctggg aatgtgctgc cccacatgtc 
aaatctctgt gtgttcttac aggaatgcat 
tccctcactt t 



gctgtgttgt tcattgtaaa aaccctttgg 60 
caggtaacgt tctcaggaag gggaggctgg 120 
gaaatctccc ancccctctt gttggaaatt 180 

191 



<220> 6 

<211> 253 

<212> DNA 

<213> Homo sapiens 



<4C0> 6 

cttctcnatg antggacaaa tgtcattggg tcagcatgag gcacagctta ccagttcaga 6 0 
ttccagtagc tgaggaacaa atcttaactc caaaaataag t&attgcgcc acttt-ggagg 120 
aattatttga ccttttcata actttgacat cacaacaatg agggtgaagt tagtaaaata 180 
aatgattatt atgaggataa aatgagaaaa tgaattnagt gcttaagaca atgcttggta 24 0 
actagttaan ccg 253 

<210> 7 

<211> 153 

<212> DNA 

<213> Homo sapiens 

<400> 7 

ggtntttcac ttggttggtt aacattactt ctaagttttt tattgttttt tatgctattg 60 

ctaatgggat tgctttctta atttattttt tccaatagct tgttgttagt ttatatcaaa 120 
tgcaactgtt tttctatgca aattatgttt cct 153 



<210> 8 

<211> 238 

<212> DNA 

<213> Homo sapiens 



<400> 8 

ttggtggtgc cctaggtttg gcaattataa 
tctccgtgtg gacataatrt tccagttcat 
ggatcctatn ataaaaatat, ntctcgtttc 
cagtggctgt ccctttttgt atccccacca 



ataaagctgc tacaaacatt catgtgcagg 60 
ttgggtaaaa cccaagggag cacaactgtt 120 
atttaaaaaa cctgggaaac tatctnccca 180 
acaatgttgg aaagcctatt gccancar 238 



<210> 9 

<211> 182 

<212> DNA 

<213> Homo sapiens 



<400> 9 

catgnctcac agtgttctga ggctgctctg 
caggtcttaa anagtttatc agcttnccca 
gttcaaatan cctatcaact tggcncctgt 
tc 



gacatgcaat cttgcatgct tctgtcatga 60 
aatagctgaa tgacanaaca ctggattttr. 120 
gttgcggttg tcacttggta acaaaacaag 180 

132 



<210> 10 

<211> 259 

<212> DNA 

<213> Homo sapiens 



<400> 10 

taattgacaa ataaaaattg tatactttnc 
ggattgtgga atggctaagt eagaaattct 
tnngctttaa aaaatatgta aatganaata 
ctttcacatt tnngttacat attatttccc 
ataatgtgtc tgacactga 

<210> 11 
<211> 195 
<212> DNA 



atatttaaca tgttatgcta acatatatat 60 
tttacattca tatttccata ntatttactt 120 
cttatttttt tcagtgtcac tgccttgata 180 
ttncatctaa caaatatata ttgagtttct 240 

259 
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<213> Homo sapiens 
<400> 11 

tggtcactgg tgccttattt ggtttgcttg ctgaggtcat atttcctgtg gccttcatgc 60 
ttgatttgtt ggagtctagc catgtaaaan tctgttggag tctaggcatt taaaaaatag 120 
gtatttattg taatctttgc catttgcttg tttgtatcca tccttcttgg gaaggcttta ISO 
caggcattca aaagg 195 

<210> 12 

<211> 656 

<212> DNA 

<213> Homo sapiens 



gccaacaaac aaaatgaaat aagacctggg atgtattttt tggccaaggc aattagaaaa 60 
tgattagtat ccttatcagg agcaatttca gagaatgttt gggtggacgt ctaactacag 120 
tggagtcaaa cgtgaatcaa cggtgaaaaa aggacaatag ccaatgtgta cactttttat 180 
aaaaaccacc ctccaaggac caggcactgg ccctctctcc ggtgcccaca gacatccaca 240 
caggcccaaa gaatcaggga ttgcacaagc cagagcaatc gaacggttct gagtcatctg 300 
ccggaagcct tgccctcaat caaggcggac gtgaagcatc tacaaaggag gaatagrcaa 360 
agcagcagcg gcggcggcgg cggcggcagc agcagcagca gcaggaggtg ggggcctctg 420 
ccaggtaccg ggcggggcag gcacggaggt gcccaggttc ccgcggaggc cacctcttcc 4 80 
ctggagtgcg tgagagaggg gaagggagga aggccagagc aggaatcaga gcgaggcaaa S4 0 
9Sfcgggcagg aactaxgaga atgacsgcgg gaggcggccg ggaaagaaax tctcggggct 600 
gtgggggtcx ccctggcacc agccggggtc ccaagcccca ccgcgagacc ccgcga 656 

<210> 13 

<211> 22 

<212> DMA 

<213> Homo sapiens 



<210> 14 

<211> 19 

<212> DNA 

<213> Homo sapiens 

<400> 14 

cgctctgatt cctgctctg 19 

<210> IS 

<211> 546 

<212> DNA 

<213> Homo sapiens 



ttcagtagaa ggaagcacag caaatttgcc tttatagaga ttcaattctt ggtgcttggg 60 
ccaaagaata agaattacat taagcaggcc gggcacggtg gctcacacct gtaaaaccag 120 
aactttggga ggccgaggca ggcagatcat gaggtcagga gatcgagacc atcctggaca 180 
acatagtgaa accccatctc tactaaaaat acaaaaatta gccgggcatg gtggtgcatg 240 
cctgtaatcc cagctactca ggaggcggag gcaggagaat cccttgaacc agggagttgg 300 
aggttgcagt gagccgagar cacgccacag cactctagcc tggcgacaga gtgagactcc 360 
atctcaaaaa aaaaaaaaaa aaaaaaaaaa ttacattaag cagcagcagc agcagtgasa 420 
gagggaakaa tgaaagaaga aatttctaga ataagattga tctccagcac catgccaatc 4 30 
atggactgga tacaattcat gcatatcttt tgtgagagag gtgagagatg tgaatcctct 540 
ctcatt 546 

<210> 16 

<211> 22 

<212> DNA 

<213> Homo sapiens 



<400> 12 



<400> 13 

atcgaacggt tctgagtcat ct 



22 



<400> 15 
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i u 



<4D0> 16 

agaaggaagc acagcaaatt tg 



22 



<210> X7 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 17 

gcatggtgct ggagatcaat 20 

<210> 18 

<211> S73 

<212> DNA 

<213> Home sapiens 



tgggagttaa agcagacatt eggctttngt gttgecagag ttctaacata agttcttttt 60 
catctgggca ggcngatgtt ccttccatct tngaagnacn gtccttttca ttttttttat 120 
ttngcttttg gsktttatct tettagaegt cttcaggagt tkgattgtag kgtaaggcag 130 
atttagttga ctgggctttg tttctggaaa attttaaagg ggcaagtcct gggctgeata 240 
ttcttactcc gggggcttag tactggcccc taaatttgtt ctctggctcc tcaaggttag 300 
aaatctgctg gctggagggg ctgagatgtt ccttgactgc tggecagaac attccgccgg 360 
ggggtggcaa ccgaagtgtt tctttgggca atggcagcag aattcatgat tgttttcatg 420 
trccagcagc agtggcagcg caktgagttg catgattgtt ggctggggct gagtgctggc 480 
asgcactgga gtgtttggct tccagtagaa attcacagca gtagtagfcgg tggcatggga 540 
aggagggcag yggtggcatg gggaggaccc ccc 573 

<210> 19 

<211> 22 

<212> DNA 

<213> Homo sapiens 



<210> 20 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 20 

ccttcccatg ccaccactac ta 22 

<210> 21 

<211> 597 

<212> DNA 

<213> Homo sapiens 



tgtaattccc agcaatttgg ggagcccaag gegggcagat tcatgagttc gggaagattc 60 
gagaccnttc ctggctaaac aegggggaaa ccccntcttt actaaaaaat accaaaaaat 120 
taacctgggc gtggtggcgg gccccagcta ntceggagge tgaggcagga gaatggtgtg 180 
aaccegggag gcggagcttg cagtgagccg agatcccgct actgcactcc agectgggea 240 
atagagggag actccgtctc aaaaaaaaaa aaaaataaat aataataaaa aaaataacaa 300 
taataatact aataattget tgatatttta caaaagcaaa aggaaaagaa gactaggcaa 360 
gaaaaaaaaa acctccttag atggtagaac tcaggtttaa aattaaaact tattctggcg 420 
tcagsctagt tgtatatttt gacctcttta aatgctctga actatgatac ggagtaacag 480 
egatgetget getgetgetg ctgctgctga tggtggtggt gttttaatat cgaataaaag 54 0 
ttgtggaaac taaatttcat ttctgccaat taactaagat tgcaaagtta aacatct 597 

<210> 22 
<211> 22 
<212> DNA 



«J00> 18 



<400> 19 

ggctgagatg ttccttgact gc 



22 



<400> 21 
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<213> Homo sapiens 
<400> 22 

tttgcaatct tagttaattg gc 22 

<210> 23 

<2I1> 24 

<212> DNA 

<213> Homo sapiens 



<400> 23 

gaactatgat atggagtaac agcg 



24 
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ABSTRACT 

The present invention comprises the use of an 8,9 cM 
region of human chromosome ISq disposed between 
5 polymorphic markers D18S68 and D18S97 9 or a fragment 
thereof for identifying at least one human gene, 
including mutated and -polymorphic variants thereof, 
which is associated with mood disorders or related 
disorders. The invention also provides methods for 

10 determining the susceptibility of an individual to 
mood disorders or related disorders, comprising 
analysing a DNA sample for the presence of a 
trinucleotide repeat expansion in the above region. 
Polynucleotide sequences useful for detecting the 

15 presence of such trinucleotide repeat expansions are 
also provided. 
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GTCTTTATTTCATATA ACTATGCTCTGATCTTTGTTACTTT CTC CTTTTAAC 
TCAGTTTAAGCTTTATTCTTATTTTCCAGCTGCTGAAGGTATATAGTTAGG 
TTGTTTATTGG ATACCATTCTTTCCCGTTAAT GTCAGTGGTTACTGCTATC 
AATGTAGCAGTTA 



AT AAGGTATATTATTTGTGTCG TGAGTTAAGAAATCATTAATAACTATTTT 

CAGMTGACAAATGTCATTATATGTTGTAAAAAAGATAAATACGTGAAATT 

ATGAGGTTAAGAAAAGTTTA 



/firs. 

ACATAAAATGTCGCTCAAAAACAATTATGTGTG TCTACACATATGGGAAA 
GCAGGAA ACAAATTTGTTTACAACATACATTACTTTTG HUM AGGCAAG 
ATAAAATNT CCTACCTCCAAAACCACCAGCA CNGTCCGCAATAACTATAC 
ATC 



AATATCATTCTTCACCCACGTTATAC ATAAGAGACCAGAATGTGATA TTGT 

CATCTCACATGGAAAAATCTGCTGTGATCAGTTCCTGAAGCTTGCTGTGA 

TCCTCCCTTAGGAAAGTAGAAAAATC I I' !' fTGAAACACTTTATTCTACAAT 

CAATGAAAATTAGGTGAAGCTACAGAAGCCAGAAATTACTCTAAGATTAG 

ACAATTATTTAAGANGACCAATTGTCTTTGGTCTTCTTCTGAAGGGTCTG 

ACTACCCTCCTCCAAAGAATTCACTGGCCGTCGTTTTACAACGTCNTGA 



GGAGGGTGTTNTCACANAAGTC TGGGGTGCGCTGTGTTGT TCATTGTAA 
AAACCCTTTGGANCATCTGGGAATGTGCTGCCCCACATGTCCAGGTAAC 
GTTCTCAGGAAGGGGAGGCTGGAAATCTCTGTGTGT TCTTACAGGAATG 
CATGj^AATCJCCCANCCCCTCTTGTTGGAAATTTCCCTCACTTT 
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CTTCTCNATGANTGGACAAATGTCATTGGGTCAGCATGAGGCACAGCTr 
ACCAGTTCA GATTCCAGTAGCTGAGGAACAAATCTTAACTCCAAAAATAA 
GTAATTGCGTCACTTTGGAGGAATTATTTGACCTTTTCATAACTTTGACAT 
CACAACAATGAGGGTGAAGTTAGTAAAATAAATGATTATTATGAGGATA5i 
AATGAGAAAATGAATT NAGTGCTTAAGACAATGCTTGGTAACTAGTTAAN 

CCG 



GGTNTT TCACTTGGTTGGTTAACATTACT TCT AAG 1 M il lATTGl 1 I H iA 
TGCTATTGCTAATGGGATTGCTTTCTTAATTTAI I 1 I I I CCAATAGCTTGT 
TGTTAGTTTATATCAAATGCAACTGTTTTTCTATGCAAATTATGTTTCCT 



TTGGTGGTGCCCTAGGTTTGGCAATTATAAATAAAGCTGCTACAAACATT 
CATGTGCAGGTCTCCGTGTGGACATAATTTTCCAGTTCATTTGGGTAAAA 
CCCAAGGGAGCACAACTG TTGGATCCTATNATAAAAATATNTC TCGTT TC 
ATTTAAAAAACCTGGGAAACTATCTNCCCACAGTGGCTGTCCCl I I 1 l GT 
ATnrrrAP.nAAn AATGTTGGAAAGCCTATTGCC ANCAT 



CATGNCTCACAGTGTTCTGAGGCTGCTCTGGACATGCAATCTTGCATGC 
TTTTGTCATGACAGGTCTTAAANAGTTTATCAGCTTNCTCAAATAGCTGAA 
TGACANAACACTGGATTTTTGTTCAAATANCCTATCAACTTGGCNTCTGT 
GTTGCGGTTGTCACTTGGTAACAAAATAAGTC 



TAATT G AC AAATAAAAATTG T AT ATTTT N C ATATTTAA C AT G TTATG C TAAC 
ATATATATGGA TTGTGGAATGGCTAAGT CAGAAATTCTTTTACATTCATAT 
7TCCATATTATTTACTTTNNGCTTTAAAAAATATGTAAATGANAATACTTAT 
TTTTTTnAnT^T nA^TGCCTTGATACTTTC ACATTTNNGTTACATATTATTT 
CCCTTNCATCTAACAAATATATATTGAGTTTCTATAATGTGTCTGACACTG 

A 

TGGTCACTGGTGCC TTATTTGGTTTGTTTGCTGAGGT CATATTTCCTGTG 
GCCTTCATGCTTGATTTGTTGGAGTCTAGCCATGTAAAANTCTGTTGGAG 
TCTAGGCATTTAAAAAATAGGTAmATTGTMTCTTTGCCAmGCTlGT 
TTGTATCCATCCTTCTTGGGAAGGCTTTACAGGCATTCAAAAGG 
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GCCAA CAM CAAAA TGAAA TAA GA CCTGGGA TG TA TTTTTTGGCCAA GGCAA TTA GAAA 
A TGA TTAGTA TCCTTA TCAGGAGCAA TTTCAGAGAA TGTTTGGGTGGA CGTCTAACTA CA 
GTGGA G TCAAACGTGAA TCAA CGG TGAAA AAA GGA CAA TAGCCAA TG TG TA CA C TTTTT 
A TAAAAACCA CCCTCCAAGGA CCA GGCA CTGGCCCTCTCTCCGG TGCCCACA GACA TC 
CA CA CA GGCCCAAA GAA TCA GGGA TTG CA CAA GCCA GAGCAA TCGAA CGG TTC TGA G T 
CA TCTGCCGGAA GCCTTGCCCTCAA TCAA GGCGGA CG TGAAGCA TCTA CAAA GGA GGA 
ATAGTCAAAGCAGCAGCGGCGGCGGCGGCGGCGGCAGCAGCAGCAGCAGCAGG 
AGGTGGGGGCCTCTGCCAGGTACCGGGCGGGGCAGGCACGGAGGTGCCCAGGTT 
CC CGCGG AGGCC ACCTCTTC CCTGGAGTGC GT GAG AG AGGGGAAGGG AGG AAGG 
CCAGAGCAGGAATCAGAGCGAGGCAAAGGCGGGCAGGAACTAXGAGAATGACS; 
GCGGGAGGCGGCCGGGAAAGAAA^TCTCGGGGCTGTGGGGGTCgDCCTGGCACC 
AGCCGGGGTCCCAAGCCCCACCGCGAGACCCCGCGA 



/7a /S6. 

5 ' - AT CGAACGGTTCTG AGTC ATCT 
5'-CGCTCTGATTCCTGCTCTG 



/7£ /&a. 

TTC A G TA GAA GGAA GCA CAGCAAA TTTGCCTTTA TAGAGA TTCAA TTCTTGGTGCTTGG 
GCCAAA GAA TAA GAA TTA CA TTAA GCA GGCCGGGCA CGG TGGCTCA CA CC TG TAAAA C 
CA GAA CTTTGG GA GGCCGA GGCA GGCA GA TCA TGA GG TCA GGA GA TCGA GA CCA TCC 
TGGA CAA CA TA G TGAAA CCCCA TCTCTA CTAAAAA TA CAAAAA TTAGCCGGGCA TGGTG 
GTGCA TGCCTG TAA TCCCA GCTACTCA GGA GGCGGA GGCA GGA GAA TCCCTTGAA CCA 
GGGAGTTGGA GGTTGCAGTGA GCCGAGA TCA CGCCACA GCA CTCTA GCCTGGCGACA 
GAGTGAGACTCCA TCTCAAAAAAAAAAAAAAAAAAAAAAAAA TTACA TTAAGCAGCAGC 
AGCAGCAGTGASAGAGGGAAKAATGAAAGAAGAAATTTCTAGAATAAGATTGA 
TCTCCAGCACCATGCCAATCATGGACTGGATACAATTCATGCATATCTTTTGTGA 
GAGAGGTGAGAGATGTGAATCCTTTCTCATT 



/7£/^J>. 

5 '-AGAAGGAAGC AC AGCAAATTTG 
5 '-GCATGGTGCTGGAGATCAAT 
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TGGGA G TTAAA GCA GA CA TTCGGCTTTNG TG TTGCCA GA GTTCTAA C ATA AG TTCTTTTT 
CA rCTGGGCAGGCNGA TGTTCCTTCCA TCTTNGAA GNA CNGTCCTTTTCA TTTTTTTTA T 
TTNGCTTTTGGSKTTTA TCTTCTTA GA CGTCTTCA GGAGTTKGA TTG TA GKGTAA GGCAG 
ATTTAGTTGACTGGGC7TTGTTTCTGGAAAATTTTAAAGGGGCAAGTCCTGGGCTGCAT 
A TTCTTA CTCTGGGGGCTTA GTACTGGCCCCTAAA TTTG TTCTCTGGCTCCTCAA GG 77 
AG AAA TCTGCTGGCTGGA GGGGCTGA GA TGTTCCTTGA CTGCTGGCCA GAA CA TfCCG 
CCGGGGGGTGGCAA CCGAA G TGTTTCTTTGGGCAA TGGCAGCAGAA TTCA TGA TTGTT 
TTCATGTRCCAGCAGCAG7GGCAGCGCAKJGAGTTGCATGAT1GTTGGCTGGGGC 
TGAGTGCTGGCASGCACTGGAGTGTTTGGCTTCCAGTAGAAATTCACAGCAGTAG 
TAGTGGTGGCATGGGAAGGAGGGCAGYGGTGGCATGGGGAGGACCCCCC 



/7£ 

5 '-GGCTGAGATGTTCCTTGACTGC 
5'- CCTTCCCATGCCACCACTACTA 



TG TAA TTCCCA GCAA TJTGGGGA GCCCAA GGCGGGCA GA TTCA TGAGTTCGGGAA GA T 
TCGAGACCNTTCCTGGCTAAACACGGGGGAAACCCCNTTTTTACTAAAAAATACCAAAA 
AA TTAA CCTGGGCG TGG TGGCGGGCCCCA GCTA NTCCG GA GGC TGA GGCA GGA GAA T 
GGTG TGAA CCCGGGA GGCGGA GC TTGCA G TGA GCCGA GA TCC 'CGCTA C TGCA C TCCA 
GCCTGGGCAA TAGAGGGA GA CTCCGTCTCAAAAAAAAAAAAAAA TAAA TAA TAA TAAAA 
AAAA TAACAA TAA TAA TACTAA TAA TTGCTTGA TA TUTACAAAAGCAAAAGGAAAA GAAG 
ACTAGGCAAGAAAAAAAAAACCTCCTTAGA TGGTAGAACTCAGGTTTAAAA TTAAAACTT 
ATTCTGGTGTCAGSCTAGTTGTA TATTTTGACCTCTTTAAATGCTCTGAACTA TGA TA TGG 
AGTAACAGCGATGCTGCTGCTGC1GCTGCTGCTGCTGATGGTGGTGGTGTTTTA 
ATATCGAATAAAAGTTGTGGA,AACTAAATTTCATTTCTGCCAATTA.^CTAAGATT 
GCAAAGTTAAACATCT 



/7# /#/>. 

5 ' -TTTGC AATCTT AGTT AATTGG C 
5'-GAACTATGATATGGAGTAACAGCG 
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29384 


Timothy J. Oyer 


. 36,628 


Robert H. Walat 




George L. Greenfield 


17,756" 


Edward F. Perlman 


28,105 


Lisa E. Winsor 


_44 a 405_ 


James M. Hanifm, Jr. 


39,21? 


Michael J. Pomianek 


46,190 


David Wolf 


^17,528. 


Therese A. Hendricks 


—30,389 


Elizabeth R. Plumer 




Douglas R. Wolf 




Steven J. Henry 


* 27,90(5 


Randy J. Pritzker 


35,986 








Robert E. Rigby, Jr. 


36,904 










Edward J. Russavage 


43,069 
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Address all telephone calls to at telephone no. (617) 720-3500. Address all correspondence to: 

v l-Elizaheth^Hiimer 
c/o telf^Greenfr^^ P.C., 
Vj^eralJBLeserve Plaza 

( 600 Atlantic A venue_ 
|Boston ? MA 0221032 1 1 

I hereby declare that all statements made herein of my own knowledge are true and that all 
statements made on information and belief are believed to be true; and further that these 
statements were made with the knowledge that willful false statements and the like so made are 
punishable by fine or imprisonment, or both, under -Section 1001 of Title 18 of the United States 
Code and that such willful false statements mayjeopardize the validity of the application or any 
patent issued thereon. 




BROECKHOVEN 



Inventor's signature , - 

Full name first inventor: ^v ^Gfinstine^ 
Citizenship: . JCAj Belgian " ~ ~ 

Residence: t Edeg gmJ3elbium \z> c cX. 

Post Office Address: Koning AJbertlei 1 5, B-2650, 



Date 



Edege 



Inventor's signature 

Full name first inventor: ^ 
Citizenship: ^^Xj^ 
Residence: ^ 
Post Office Address: 




Date 1 



Peter RAEYMAE KgRS 
Belgian 

Lint, Belgium (3 
TS3inaal Cardijnlaan 104, B- 
2547 Lint, Belgium 




Inventor's signature 

Full name first inventor: .<r\ 
Citizenship: ^-O^ 
Residence: 
Post Office Address: 



Date 



Jmg en DEL-F AVE RQ 
"^elgiarT vf 

Tienen , Belgium ' — ^ 

Oorbeeksesteenweg 149, B-3300, 

Tienen, Belgium 
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